Gene Francci3_0087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0087 
Symbol 
ID3905131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp106217 
End bp107347 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID637877417 
Productdehydrogenases and related proteins-like 
Protein accessionYP_479210 
Protein GI86738810 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.467087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCACG GACCTGCTCC GGGCTCGGAT GGCGAGCGCT CGGCCTCCGA TCGGTTCTCG 
GCGACAATCG GGTTGACCGG CGACCCTCGC CACCGGACGG GGAGCACTAT CAATGCTGTG
ACGACGCTCC CCGATCTGGT GCCGCCGCGC GATCCCGCCC TTTCTCAGGC CTGGCGGGAG
CTGCGGCGCG TCGTGCGTCG GCGGCAGCCG GACCTGCCCC TCGCGATGGC CCTGGTCACC
GACGGCTCGG ACAGCACTAC CGCCGAGGTG CTGCGCGATG CCGGGGTCGA CGTCGTCGGT
CTCCTCGCCC CGGAGCCGCT GGAGTCCCTG GCCTGGGCCG CCGAGGCGGG TGTCCGACGG
GCGTACGCCG ATCTGCTCAC CTTGCTCTCG GACGACATCG AGGCGGTCTG CGTGGAGATG
CCGCCGCCCG CCTCCGACAT CGTCGCCCGG CAAGCGGCCG AGATCGGCCT GCACGTGCTC
CTGGCCGGGC CGGCGACCGT CGAGGCCGAG GCCCTGCGCG CGGTCGCCGA CCTCGCCGAG
GAGGGTGATC TCGCCCACGT GGTCGCGCTC GACGGGCGGG CCTGGCCGGC TGCCACCCAC
GTCGCGGCGA CCGTTCCCTC CCTGGGCCGG CTCACCCAGA TGACCGTGGT GGGCGCGCCG
AACGGGCCCG CCGGGCGAGT CGAGATCATC GATCTGGCGA TGCGCTGGTG CGGCGAGATC
CTCGCGGTCT GCGCCGATCC GGCCGCGATG CCCGCCCCGG CGCTCACCCC GGACGCACCG
GTGACGCTGG CCCTGCTGGC GGCGAGCGGC GCGACGGTCC TGATCAACGA GCAGATGGGC
GGGAACCTCG GCACAGCCAC CCTCACGGTA TGCGGCGACT CCGGACGCAT CGTCGTCCGG
GGCCGTCTGG TCCGCCGCCA GGACGGCAGC GGCATCCGGG ATCTGATGAT GCCGACCGTA
CCGACCTCCC GGCCGGGGCT GATGGAGGCC ACCTACGACG TGGTGCGTGC CACCGAACTC
GACGACGCCC GGCTGGTCCG CGGCGCCACC TTCCACGACC TGCTCACCGC GAACCACCTG
ATGGCCGCCG CGCAGACCTC CCATCAGCAG GGCGGTTGGG TGGAGCTCTG A
 
Protein sequence
MRHGPAPGSD GERSASDRFS ATIGLTGDPR HRTGSTINAV TTLPDLVPPR DPALSQAWRE 
LRRVVRRRQP DLPLAMALVT DGSDSTTAEV LRDAGVDVVG LLAPEPLESL AWAAEAGVRR
AYADLLTLLS DDIEAVCVEM PPPASDIVAR QAAEIGLHVL LAGPATVEAE ALRAVADLAE
EGDLAHVVAL DGRAWPAATH VAATVPSLGR LTQMTVVGAP NGPAGRVEII DLAMRWCGEI
LAVCADPAAM PAPALTPDAP VTLALLAASG ATVLINEQMG GNLGTATLTV CGDSGRIVVR
GRLVRRQDGS GIRDLMMPTV PTSRPGLMEA TYDVVRATEL DDARLVRGAT FHDLLTANHL
MAAAQTSHQQ GGWVEL