Gene Franean1_2453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2453 
Symbol 
ID5670849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2915155 
End bp2917458 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content73% 
IMG OID641241370 
Productoxidoreductase alpha (molybdopterin) subunit 
Protein accessionYP_001506791 
Protein GI158314283 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01701] oxidoreductase alpha (molybdopterin) subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.786176 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCCG AGCTGAGGAT CGGCGAACCG GCGACGTCCG CCGCCGGTGT TCCCGGCGTG 
GCACACGCGC TGGCGCGCTG CCGGGAGCTG ATGGGCGTGC GCCGTTCCGT GCAGACCCTG
CGGCTACTCA ACCAGGACGA CGGTTACGAC TGTCCCGGCT GCGCCTGGCC CGACCCGCGG
CCGGACGACC GTTCACGCGC CGAATTCTGC GAGAACGGCG CGAAGGCCGT CGCGCAGGAG
GCCACCCGGG CCCGGGTGAC ACCGGAGTTC TTCGCCGCGC ACAGCCTGAC CGACCTCGCC
ACCCGCTCCG GGCACTGGCT CGGTGAACAG GGCCGGCTCA TCAACCCCGT GTACAAGGCC
GCGGGCAGCG ACCATTACGC CCCCGTCGGC TGGGACGAGG CCTTCCGGAT CGTCGCCGAC
GAGCTGACCG GCCTGGAAAG CCCCGCCGAG GCCGCCTTCT ACACCTCGGG GCGGACGAGC
AACGAGGCCG CTTTCCTTTA CCAGCTCTTC GCCCGCGCGT TCGGCACGAA CAATCTGCCG
GACTGCTCGA ACATGTGCCA CGAGTCGTCG GGTGCCGCGC TGACCGAGAC GATCGGGGTC
GGCAAGGGCT CCGTCACCCT GGAGGACCTG GAGACGGCGG ACTTGGTGCT CGTCGTCGGG
CAGAATCCGG GCACCAACCA TCCGCGGATG CTGACGTCGC TGGAGCGGCT GAAGCGGGCC
GGTGGCAGCG TCGTGGCGGT CAACCCGCTA CCCGAGGCGG CGCTGATGCG GTTCCGTAAC
CCGCAGCGGG TCTCCGGGGT GCTCGGCCGG GGCACCGCGC TCGCCGACCA GTTCCTGCAG
ATCCGCCTCG GTGGGGACAT GGCCCTGTTC CAGGCGCTGT CCGCCCGACT GCTCGCTGCC
GAGGAGGCCA GCCCCGGCAC CGTCCTCGAT CAGGCCTTCA TCGCCGGGCA CACGACCGGG
TTCGAGGAGT TCGCGGCGCA CGTACGCGCC CACCTGACCC CCGGCGACGT CGCCACCGCC
ACCGGTCTAC GCCCAGATGA GATCGACGAG CTCGCCGGGC GGGTCCTCGC CGCCGACAAG
GTGATCGTCT GCTGGGCGAT GGGGCTCACC CAGCACCCGG ACGCCGTCGC CACGATCCGG
GAGGTGGTGA ACTTCCTGCT GCTGCGGGGC AACATCGGCC GTCCCGGCGC GGGGGTCTGC
CCGGTGCGCG GCCACTCCAA CGTCCAGGGC GACCGGACGA TGGGCATCTG GGAACGGATG
CCCGACGCGT TCCTGGACGC CCTCGGCACC GAGTTCGGCT TCGCGCCGCC CCGCCACCAC
GGCCTGGACG TCGTCGACAC CATCCGGGCC ATGCGCGACG GCCGGGTGAA GGTGTTCGTC
GCGATGGGCG GCAACTTCGT CGCCGCGACG CCGGACTCGG CCGTCACCGA GGACGCGATC
CGCCGGTGCC GGCTCACCGT GCAGGTCTCC ACCGCGCTCA ACCGCTCGCA CGCCATCACC
GGCGAGCGCG CCCTCATCCT GCCCACACTC GGCCGCACCG AGCTGGACGT CCAGGCCGGC
GGCCCGCAGC GGGTCAGCGT CGAGGACTCG ATGGGCTCCG TCCACGCCTC CCGCGGCCGG
TTGGCCCCGG CCGGGCCCGA GCTGCGCTCC GAGATCGCGA TCATCTGCGG CCTGGCCGCG
GCGACCCTCG ACCGCACGGC CACGGCCACG GCCACGGCCG CCGCGCCGAC GGTCGACTGG
GCGGCGCTGG CCGAGGACTA CCGGCGGGTA CGCGCGCACA TCGCGAACGT GGTCCCCGGG
TTCGCCGACT ACGAGGCCCG CCTCGACGAG CCCGGCGGCT TCCTGCTCCC GCACCCGCCC
CGCGACAGCC GGACCTTCCC GACGCCGAGC GGGCGGGCCG CCTTCACCGT GAACACCTGC
GAGATCCGGC CGACGCCCCC CAGCCACCTG CTCCTGCAGA CGATCCGCTC CCATGATCAG
TACAACACCA CTGTCTACGG TCTGGACGAC CGTTACCGCG GGGTCCGCCA CGGCCGCCGG
GTCGTGCTGG TCAGCCCGGA CGACCTGGCC GAGCTCGGCA TCGCCGACGG CGCCCGGGTC
GACCTGATCG GGGTCTGGAC CGACGGCGTG CAGCGCCGCG CCCCCGACTT CCGGGTGGTC
TCGTATCCCA CCGCCCGCGG CTGCGCCGCC GCCTACTTCC CGGAGACCAA CGTCCTGGTG
CCCCTCGACA GCACCGCGAA GCGCAGCAAC ACCCCGACGT CGAAGTCCAT CCTGATCCGG
CTGGACGTCC ACCCGGAACA GTGA
 
Protein sequence
MDPELRIGEP ATSAAGVPGV AHALARCREL MGVRRSVQTL RLLNQDDGYD CPGCAWPDPR 
PDDRSRAEFC ENGAKAVAQE ATRARVTPEF FAAHSLTDLA TRSGHWLGEQ GRLINPVYKA
AGSDHYAPVG WDEAFRIVAD ELTGLESPAE AAFYTSGRTS NEAAFLYQLF ARAFGTNNLP
DCSNMCHESS GAALTETIGV GKGSVTLEDL ETADLVLVVG QNPGTNHPRM LTSLERLKRA
GGSVVAVNPL PEAALMRFRN PQRVSGVLGR GTALADQFLQ IRLGGDMALF QALSARLLAA
EEASPGTVLD QAFIAGHTTG FEEFAAHVRA HLTPGDVATA TGLRPDEIDE LAGRVLAADK
VIVCWAMGLT QHPDAVATIR EVVNFLLLRG NIGRPGAGVC PVRGHSNVQG DRTMGIWERM
PDAFLDALGT EFGFAPPRHH GLDVVDTIRA MRDGRVKVFV AMGGNFVAAT PDSAVTEDAI
RRCRLTVQVS TALNRSHAIT GERALILPTL GRTELDVQAG GPQRVSVEDS MGSVHASRGR
LAPAGPELRS EIAIICGLAA ATLDRTATAT ATAAAPTVDW AALAEDYRRV RAHIANVVPG
FADYEARLDE PGGFLLPHPP RDSRTFPTPS GRAAFTVNTC EIRPTPPSHL LLQTIRSHDQ
YNTTVYGLDD RYRGVRHGRR VVLVSPDDLA ELGIADGARV DLIGVWTDGV QRRAPDFRVV
SYPTARGCAA AYFPETNVLV PLDSTAKRSN TPTSKSILIR LDVHPEQ