Gene Franean1_0588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0588 
Symbol 
ID5669005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp680387 
End bp682402 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content69% 
IMG OID641239515 
Productmolybdopterin oxidoreductase 
Protein accessionYP_001504953 
Protein GI158312445 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGA TATCGGTTAC CGACACGAGG ACGGTCCGCA GCTTCTGCCG GATCTGCACG 
TCCGTGTGCG GCATCCTCGT CGAGACGGCT GGTGACAAGG TAGTTCGGGT ACGGGGCGAC
CGTGACCACC CACTGTCGCG GGGATACACC TGTCCGAAGG GCCGGTCACT CCCGCAGATG
CACCATCATC CGGATCGCAT CGAGCGTCCG CTGATGAAGG TCGACGGGGA GCTGCGGCCG
ACGACGTGGG AGGAGTGCCT GGACGATCTC GGCGCCCGGC TGCGAAACAT CATCGAGCGG
TACGGGCCCG AGTCGGTCGG TGTCTTTTTC GGGAGCGGCA TCGGCATGGA CGCCGCCGGT
TACCGGATGG CGCAGGCCCT GCACGCCGCG ATCGGCACGC CGGCGAAGTT CAGTCCCATG
ACCATCGACG GAACGGCCAA GGTGCTGACC GCGGATCTGG TGGGCGGTTC ACCGGCTCTC
AGCGGCCGGC CCGACTACGA CAACGCCTCG TTCGTCCTCT TCGTCGGCAG CAACCCGGTG
GTGTCCCACG GGCACACCGT CGCGATGCCG AACCCCACGG GCACCTTACG GGCGCTGCGG
GAGCGGGCGG AGGTGTGGGT CATCGACCCC CGTCACACCG AGACCGCCCG CCTGGCCGGC
CACCATCTCG CGCCGCGTCC CGGCACCGAC TACGCGGTCC TCGCCTACCT TGTCCGTGAG
ATCCTCCGCG ACGGCGCCGA CCGCGAGATG CTCTCCCGTC ACACCCAGGG TGGCGAGATC
CTGGCTGCCG CCGTTGAGCC GTTCACTCTG GAGCACGCCG CCCGAATCGC CGATGTCTCC
GCCGACGAGC TGGCCGCGCT CCTCGCCGGC GTGCGACGAG CGGGGCGCGT CGCGATCGAA
ACCGGAACCG GCGTCACCAT GGCGTCCAGC GCGAACGTCA CGCAGTGGCT CGCCTGGTCA
CTAATGATTA TCACTGGGTC GATGAACCAG CCCGGCGGCG CATGGTTCCA CCCCGGCTTC
AAAAACCAGC TGGAGGCCTT CAAGCTGCCG ATCTCGCCGC CCGAAGGCTC GTTCGGGCCG
GGCCCGCGCA GCCGTCCGGA GACACAGTCC TTTCTCGGCG AGTGGCCCTG TGCCGTTCTG
GCCGACGAGA TCCGCGCGGG CAACATCCGG GCGGTCCTCA ATCTCGGCGG CCATCTCGTC
GCGGCCTTCC CCGACACCGA GACGCTGGTT CCCGCGCTGC GGGACCTGGA GCTGTTCGCC
ACCATCGAGA TCATCGGCAA CGAGACGACG GCCCTGTCCA CCCACGTCCT GCCGACCAAG
GACCAGCTGG AGCGGGCCGA CGTGAGCCTG TGGGACTTCC TGATACAGCG CGTCGCCGTC
CAGCACACCC CTGCCGTCGT CGAACCGGTC GGGGACCGGC GTTCCGTGTG GTGGGTGCTC
GCGGAACTCG GACAGCGCCT CGGTTACCAG CTCGCCGACA GCAGATCCGG GCAGGTCACC
GACGACACCC TGCTCGCCGA GATCACCGCC CACGCCCGGC GCCCGTTCGG TGAGGTCGTC
TCCGAAGGCT GGGTCGAGGT ACCCCGCGAG ATTCCCGCGC CGTGGGTGGA CGGGCACGTC
GAGCGGATGG GGGGATGGCG CCTCGCTCCC CGGCTGCTCG TCGACCAGCT GGCCGCGCTC
CAGCCTCCCG CCCCGCTCGT CCTCACACCA CGACGCCAGA AGCGCCATCT GAACTCCCAG
TTCGACTACC TCGGAGAACA GCCCGAGATC ATCCTGCATC CCGACGACGC GGCAGCGGCC
GGCGTGGTCG ACAGTCAGCC GGTGACCGTC CGCTCGACCA GCGGCGAGAT CACCGGGATC
GCGAAGGTCG ACGGCACCAT CCGCCGTGGA GCGGTCTCGA TACCCCACGG CCACCAGTCG
GCGAACGTCA ACCGGCTGAC GGACAAGAGC CAGGTCGACA TCGTCACCGG CATGGTCCGC
TACTGCGGCA TCCCGGTGAG CGTCCACCCG GCATAG
 
Protein sequence
MTEISVTDTR TVRSFCRICT SVCGILVETA GDKVVRVRGD RDHPLSRGYT CPKGRSLPQM 
HHHPDRIERP LMKVDGELRP TTWEECLDDL GARLRNIIER YGPESVGVFF GSGIGMDAAG
YRMAQALHAA IGTPAKFSPM TIDGTAKVLT ADLVGGSPAL SGRPDYDNAS FVLFVGSNPV
VSHGHTVAMP NPTGTLRALR ERAEVWVIDP RHTETARLAG HHLAPRPGTD YAVLAYLVRE
ILRDGADREM LSRHTQGGEI LAAAVEPFTL EHAARIADVS ADELAALLAG VRRAGRVAIE
TGTGVTMASS ANVTQWLAWS LMIITGSMNQ PGGAWFHPGF KNQLEAFKLP ISPPEGSFGP
GPRSRPETQS FLGEWPCAVL ADEIRAGNIR AVLNLGGHLV AAFPDTETLV PALRDLELFA
TIEIIGNETT ALSTHVLPTK DQLERADVSL WDFLIQRVAV QHTPAVVEPV GDRRSVWWVL
AELGQRLGYQ LADSRSGQVT DDTLLAEITA HARRPFGEVV SEGWVEVPRE IPAPWVDGHV
ERMGGWRLAP RLLVDQLAAL QPPAPLVLTP RRQKRHLNSQ FDYLGEQPEI ILHPDDAAAA
GVVDSQPVTV RSTSGEITGI AKVDGTIRRG AVSIPHGHQS ANVNRLTDKS QVDIVTGMVR
YCGIPVSVHP A