Gene Francci3_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1438 
Symbol 
ID3903169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1731165 
End bp1732301 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content73% 
IMG OID637878775 
Productbifunctional RNase H/acid phosphatase 
Protein accessionYP_480544 
Protein GI86740144 
COG category[L] Replication, recombination and repair
[G] Carbohydrate transport and metabolism 
COG ID[COG0328] Ribonuclease HI
[COG0406] Fructose-2,6-bisphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.246631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0428381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACCG TGCGCAGGCT GGTCATCGAG GCCGACGGCG GGTCGCGGGG TAACCCGGGC 
CCGGCGGGCT ACGGGGCCGT GGTCCGCGAC GCCGGCTCGG GGAAGCTGCT GGCCGAGCGC
GCGGACTCGA TCGGGCGCGC CACGAACAAC GTCGCCGAGT ACTCCGGGCT GATCGCCGGG
CTGCGGGCCG CCGCTGAGAT CGCCCCGGAC GCCGAGCTGG AGGTCCGGAT GGACTCCAAG
CTCGTCGTCG AGCAGATGAG CGGCCGGTGG AAGGTCAAGC ATCCGGCGAT GCGGCCGCTG
GTGGCCGAGG CGACGGAACT CGCCGCCCGG TTCCCCGCGG TCCGTTTCCA GTGGGTGCCC
CGCGCGCGCA ACGCCGACGC CGATCGGCTC GCGAACGAGG CGATGGACGC CGCCGCCGCC
GGGCGGGCCT GGGAACCCGC GGTCCCGCAG TCCCCGGACC CGGTCCCGCA CCACGCGCCG
ACGACGAACC GGCTCTCCGG CTGGATGGCG CCGCCCGCAC CGCCGACGAC GACCGTGCTG
CTGCGTCACG GGCAGACCCC GCTCTCGGTG GAGAAGCGCT TCAGCGGGAC GGTCGAGGCG
GCCCTGACCG ACGTCGGGCT CGGCCAGGCC GCGGCCGTAG CGAACCGGCT GCGCGACGAG
CCGTTCGATC TCGTCGTCAG TTCGCCGCTG AAGCGGGCCC GGCAGACCGC CGAGGCCCTC
GGCCGCGACT ACGTCGTCGT CGACGATTTG CGGGAGACCG ACTTCGGCGC CTGGGAGGGG
CTGACCTTCG CCGAGGTGCG TGAGAAGTTC CCCGATGAGC TCAATGCCTG GCTCGCCGAC
CCGCACGTCC CGCCGCCCGG CGGGGAGAGC CTGATCGCCA CCGTGGCCCG GGTAGCGCGG
GTGCGTGACC GGCTGCTCGC CGAGCAGCCA GGCGGGCGGG TCCTCATCGT CTCACACGTC
ACGCCGATCA AGGGGTTGGC CCAGCTGGCG CTGGCGGCGG AACCGGCCGT GCTCTACCGC
CTCCACCTGG ATCTGGTGTC GATGACGACC ATCGACTGGT ACTCCGACGG CCCGGCGGTG
CTACGCGGGT TCAACGACAC CCATCACCTG GCGGGCCAGA CGATCTACGG CGAATAG
 
Protein sequence
MGTVRRLVIE ADGGSRGNPG PAGYGAVVRD AGSGKLLAER ADSIGRATNN VAEYSGLIAG 
LRAAAEIAPD AELEVRMDSK LVVEQMSGRW KVKHPAMRPL VAEATELAAR FPAVRFQWVP
RARNADADRL ANEAMDAAAA GRAWEPAVPQ SPDPVPHHAP TTNRLSGWMA PPAPPTTTVL
LRHGQTPLSV EKRFSGTVEA ALTDVGLGQA AAVANRLRDE PFDLVVSSPL KRARQTAEAL
GRDYVVVDDL RETDFGAWEG LTFAEVREKF PDELNAWLAD PHVPPPGGES LIATVARVAR
VRDRLLAEQP GGRVLIVSHV TPIKGLAQLA LAAEPAVLYR LHLDLVSMTT IDWYSDGPAV
LRGFNDTHHL AGQTIYGE