Gene Francci3_3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3106 
Symbol 
ID3904232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3679321 
End bp3680643 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content71% 
IMG OID637880427 
ProductRieske (2Fe-2S) protein 
Protein accessionYP_482192 
Protein GI86741792 
COG category[C] Energy production and conversion 
COG ID[COG0723] Rieske Fe-S protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.123107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.358682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACC AGACGGGCCG GGGCACGCCG GGGCCGGCGG ATTCCGGTGC GGACCAGCCG 
TCCGTTCCGA CGGACCGGCG CATGCACTAC CCGGGCTACC AGGGCGAACC GGCCGACGAA
TCTGACCAGG ACTCGATGAG CAGCATCTTC AGAAGGGCGA AACACCCGGT GGACACCTCC
GGCCAGCCCG CCGGGGGATC CGGGGTGTCC GCCGGCGGCC CGGACGTCCC GGCCGGGACG
TCGGACGGCC CGGAGGCCTT GGGCACGCCG CCCCACCAGA CCGATTCGAC CCGTGAGCCC
GCGACCACGG CCGTTCGCCC GCAGGCGGGG GTCATCCAGC CGACCTACGG GGAGCGGGCC
GCCGGTGGGG TGACCGTCCG TCCGCCGCGC GCGCAGGACG TGGACCGCCG GGCCGAGCGG
CGGGCGGAGC GGCTGATTGC CGGCTGGTTC CTGATCTCGG TGATCGGCAC CGTCGGGTTC
GTTGTCACCA ACTTCGTCGG TGACAAGTAC AAGGCGTACT ACACCCCCTC CCTCGGGGCC
GCGCTGGGCC TCGCCGTCGG TGGGCTCGGC ATCGGCCTGA TCCTGTGGGC CAAGCGGCTG
ATGCCGCACG AGCAGGCCGT GCAGGAGCGC CACGCGTTCG CCTCCTCCAA GGAGGAGATC
GCCGTCACCG AGGAGGAGAT CGCGGCCGGG TTCGCCGACA CCGGCCTCGC GCGCTACCCG
TTGCTGCGCC GCACCCTGCT CGGGGCGGGC ACCGTGCTCG GCGGGCTGAT CGTGGTGCCG
CTGCTGAACC TGACGAACAC CAAGCCGGGC AAGAAGCTCG ACCACACGAG CTGGGTGAAG
GGTGCCCGCC TCGTCACGGA GGACGGCCGT TACGTCAAGC TCGGCGACGT CGCCATCGGC
GGCATCGAGA CGGTGTTCCC GGCCGTTCCG GTGACCGAGT CCGACGGCAC CGTTGCGTAC
CAGCCGAAGA CCGATGTCCA GACCAAGGCC GACAGCACGA CGCTGCTCAT CCGGCTGCCA
CCGGGGGTCG ACCGCCCCCG CAAGGGCCGG GAGGACTGGG GCGTCGACGG CCACGTCGCC
TACTCCAAGA TCTGCACGCA CGCCGGCTGC CCGGTCAGCC TGTACGAACA GCAGACCCAT
CATCTGCTCT GCCCCTGCCA CCAGTCCGTC TTCGACGTGC GGGACGGCTG TCGGGCGATT
TTCGGGCCGG CCAGCCGGTC CCTGCCACAG CTTGCCATCG CCGCGGACAA GGACGGGTTC
CTCTACGCCC GTGACGGTGA CTACCACGAG CCCGTCGGTG CCGCCTTCTG GGAGCGCTCA
TGA
 
Protein sequence
MSDQTGRGTP GPADSGADQP SVPTDRRMHY PGYQGEPADE SDQDSMSSIF RRAKHPVDTS 
GQPAGGSGVS AGGPDVPAGT SDGPEALGTP PHQTDSTREP ATTAVRPQAG VIQPTYGERA
AGGVTVRPPR AQDVDRRAER RAERLIAGWF LISVIGTVGF VVTNFVGDKY KAYYTPSLGA
ALGLAVGGLG IGLILWAKRL MPHEQAVQER HAFASSKEEI AVTEEEIAAG FADTGLARYP
LLRRTLLGAG TVLGGLIVVP LLNLTNTKPG KKLDHTSWVK GARLVTEDGR YVKLGDVAIG
GIETVFPAVP VTESDGTVAY QPKTDVQTKA DSTTLLIRLP PGVDRPRKGR EDWGVDGHVA
YSKICTHAGC PVSLYEQQTH HLLCPCHQSV FDVRDGCRAI FGPASRSLPQ LAIAADKDGF
LYARDGDYHE PVGAAFWERS