Gene Francci3_2209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2209 
Symbol 
ID3906348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2583822 
End bp2584937 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content74% 
IMG OID637879541 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_481307 
Protein GI86740907 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00644027 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGCGG ATGGGGCCGG CCCGGGGGTG CGGACGTTGC GGCTGGTGAA CCAGGATCAC 
GGGTTTGACT GTCCGGGGTG CGCGTGGCCG GATCCGCCGG TGGGGGAGCG GTCGGTGGCC
GAGTTCGCCT TCACCCCGCC GCGCCGCCAC GGGTTCGATG CGGTGGGGAC GATCCGGGCG
ATGCGCGACG GCCGGGTGAG GGTGTTCCTC GGCATGGGCG GGAACTTCGT GGCCGCGAGC
CCGGACACCG CGGTGACGGA GGCGGCGATG CGGTCGTGCC GGCTGACGGT CCAGGTGTCG
ACGACGCTGA ACCGGTCGCA TGTGGTGACG GGCCGGGCGG CGTTGATCCT GCCGGCGCTG
GGCCGTACGG AGATCGACGT GCAGGCCGCC GGACCGCAGC AGGTCAGCGT CGAGGACTCG
ATGGGGATGG TGCACGCCTC CCGCGGCGGT CTGGCGCCGG CCGGCCCGGG GCTGCGCTCG
GAGGTGGCGA TCGTCTGCGG CGTCGCGGCG GCCACCCTGG CCGGCCAGCC GGAGGTGGCC
GAATCGGGGA CAGCGGACCG GGTGGGGCTC GCCGGGGACT ACCGGCGGAT CCGCGCCCAC
ATCGCCCGGG TCGTCCCCGG GTTCACCGAT TACGAGGCGG GCCTGGCCGA GCTGGGGGGA
TTCCCGCTCC CGCACCCGCC GCGGGACAGC CGGACGTTCC CGACGCCGAG CGGGCGGGCC
GCGCTGACGG TCAACACCTG TGAGGTGCTG CGGGTCCCGC CGGGACACCT GCTGTTGCAG
ACCGTCCGCT CCCACGACCA GTACAACACG ACGATCTACG GCATGGACGA CCGGTACCGC
GGGGTGCGCC GCGGCCGGCG CGTCGTGTTC GTCCACCCGG ACGACCTCGA CGACCTCGGT
ATCGCCGACG GCACCCACGT CGACCTCGTT GGGGTCTGGA CGGACGGGAT GGACCGGCGC
GCGGAGAACT TTCGCGTCGT GGCCTACCCG ACCGCCCGCG GCTGCGCCGC CGCCTACTTC
CCGGAGACCA ACGTCCTGGT CCCCCTCGAC AGCACCGCCG CCCGCAGCAA CACCCCCACC
TCGAAATCCC TGATCATCCG CCTGGAGGCA GGCTGA
 
Protein sequence
MGADGAGPGV RTLRLVNQDH GFDCPGCAWP DPPVGERSVA EFAFTPPRRH GFDAVGTIRA 
MRDGRVRVFL GMGGNFVAAS PDTAVTEAAM RSCRLTVQVS TTLNRSHVVT GRAALILPAL
GRTEIDVQAA GPQQVSVEDS MGMVHASRGG LAPAGPGLRS EVAIVCGVAA ATLAGQPEVA
ESGTADRVGL AGDYRRIRAH IARVVPGFTD YEAGLAELGG FPLPHPPRDS RTFPTPSGRA
ALTVNTCEVL RVPPGHLLLQ TVRSHDQYNT TIYGMDDRYR GVRRGRRVVF VHPDDLDDLG
IADGTHVDLV GVWTDGMDRR AENFRVVAYP TARGCAAAYF PETNVLVPLD STAARSNTPT
SKSLIIRLEA G