Gene Francci3_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2212 
Symbol 
ID3906351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2587344 
End bp2588741 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content71% 
IMG OID637879544 
Producthypothetical protein 
Protein accessionYP_481310 
Protein GI86740910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.438788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAA CGGGTCACGC GCGGGAAACG GGTCATGTCC AGGAAGAAGG CCAGGAAGGA 
GGCCACGCGC CGCGGAGACG TCACGCGCGG CTGGTAGGTC ACGGGCAGCA GGACATCGTG
AGGAGGGGAC GGCGCTCCCG GCGGCGCCGG ACCGTCATCG GGCTCGTCCT CGCCGCGACG
ACCGCGGCGA CGGCCGCCGC GACGACCAGC GCCCTCATCG ACCCCGCCGC GGCCACCGGG
CCGACCGCGG CTTGGCACCA CACATCCTCC TACCGCGGGG ATGTGGTCTC CGTCAAACGG
CTACGGGTCC TGTCGGCGGA GTATGTGCGG GCGGTCCTGG CCAGTACCGG GTTCGACGCC
GATCCGGAGG TGATCCGGTC GGGGGTGCGG ACCTACCGGG TCGTGTACTA CACGATCGAC
CCCCGTGGAC GTCGGACGAC CGCCAGCGGC CTGGTGGCGC TGCCGCGCAC CGAGGACCGC
TGGCTGCAGG TCGTCTCCTA CGCCCACGGC ACCGAGATCA ACCGCGCGGA CGCCCCGTCG
ACCGCGACCG ATCTGGAGGC CAACTCCTGG GGGCAGGCCC CGGCCCTCAC CTACGCCGCC
GCCGGGTTCG CTGCGGTCGC CCCCGACTAC CTGGGCTACG GCGTCGGGCC GGGTGACCAT
CCGTGGTTGG ATGTCCCCTC GGAAACCAGC GCGTCGCTGG ACCTGCTCCG TGCGGCGCGC
ACCGTCGCCG CGCGTGAAGG CCGGAAGTTC AAACGGGATG TGCTGGTCAC CGGGTTCTCC
CAGGGCGCTT CGGCGGCCAC GGGACTGGGG AAGGCACTGT GGGCCGGCGC CGACCCGTCG
TTCCGGCTCG GTGCCCTGGC GCCCGTCAGC GGAGCCTATG ACCTGCGTTC CGTGGAGCTG
CCGGCATGGC TGAACGGGCA GGTCGCATGG CCCTTCGGTG TCGGCTACAC CGCCTTCTTC
CTCGTCTCCT GGAACCGCCT GTACCACCTG TACCGCTCGC CCGCGGAGGT GTTCCAGGAT
CCGAAGGTCG CGACGCTCTT CGACGGCAAG CACACCGGGG AGCAGATCCT CGCCGGCCTG
CCGGCGACCA CCGGCAAGCT GCTCACCCCG CACGGCTTCG ACCTGCTCCG CAACCCCTGG
GACGGGCTGG CCGTGGGGCT GTGGGCGGCG GACCATACCT GTTCGAACTG GAACCCGAGG
GCGCCGGTCC GGCTCTTCGT CGGCGGCGCC GACCATCAGG TTCCGCGGGC CAACAGCGAG
CATTGCCAGG CCGCGTTCCT GGCACGTGGC GCCGACGCCC CGATCATCAA CCTGGGACCC
AACGTCGGTC ATCTCGACTC CAACAAGGCC GGTACGGCCG CGGTCGTCCG CTGGTTCCGG
CAACAGGTGC AGCAGTAG
 
Protein sequence
MRETGHARET GHVQEEGQEG GHAPRRRHAR LVGHGQQDIV RRGRRSRRRR TVIGLVLAAT 
TAATAAATTS ALIDPAAATG PTAAWHHTSS YRGDVVSVKR LRVLSAEYVR AVLASTGFDA
DPEVIRSGVR TYRVVYYTID PRGRRTTASG LVALPRTEDR WLQVVSYAHG TEINRADAPS
TATDLEANSW GQAPALTYAA AGFAAVAPDY LGYGVGPGDH PWLDVPSETS ASLDLLRAAR
TVAAREGRKF KRDVLVTGFS QGASAATGLG KALWAGADPS FRLGALAPVS GAYDLRSVEL
PAWLNGQVAW PFGVGYTAFF LVSWNRLYHL YRSPAEVFQD PKVATLFDGK HTGEQILAGL
PATTGKLLTP HGFDLLRNPW DGLAVGLWAA DHTCSNWNPR APVRLFVGGA DHQVPRANSE
HCQAAFLARG ADAPIINLGP NVGHLDSNKA GTAAVVRWFR QQVQQ