Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5802 |
Symbol | |
ID | 5674125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7045005 |
End bp | 7047311 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244652 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_001510054 |
Protein GI | 158317546 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.203441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAGA GAACAAGCGC GGGTACGGGT ACGGGAACGG CCGCCGGCTC GGTGACTGCC CTGCGGGTCT GCCCGCTGTG CGAGGCGACC TGTGGACTGG AGCTACGGAT CACGGACGGC CGGGTGACGG CCGCCCGTGG CGACGCGGAA CATGTGTTCA GCCGTGGCTA CCTCTGCGCC AAAGGAGCCT CCCTGCCGCA GTTACTCGGC GACCCCGACC GGCTTCGCCG TCCGCTGCTG CGCGACCGGG CCACCGGTGA GCACCGTGAG GTCGACTGGG AGAAGGCGTT CGCAGCCGTC CACGCCGGGC TGCGGCGGGT GGTCTCCGCC CATGGCCCGG CCGTGGTCGC GGCCTATCTC GGCAATCCGA ACGCGCACTC GATGGCCGGC GCCCAGCACA GGCCGGCGTT CACCCAGGCG CTGCGGACCC GCGCCGTGTT CAGCGCCTCC ACGGTCGACC AGATGCCCAT GCACGTCGCA TGCGGCCTCG TCTTCGGTCA CCCCTCGCTC ATCCCGGTGC CCGATCTGGA CCGCACCGAC CACCTGCTGA TGCTCGGTGC CAATCCGGCG GTGTCCAACG GCAGTCTGTG CACGGCGCCG GACTTCCCCG GCCGGCTGGC TGGAATCCGC GCGCGTGGCG GGCGGGTCGT CGTCGTCGAC CCACGCCGCA CCCGCACGGC CGCCCTCGCC GACGAGCACC TGCCGATCCG GCCCGGGACG GACGCGCTGT GGCTGTTCGC GATCGTCAAC GTCCTCGCCG CCGAGGGGCT CGTCCGCCTC GGCCCGCTCG CCGCGCACCT GTCCGGAGCG GACAGCGTCG CCGAGCTTGC CGCGCCGTTC ACCCCGGAGC GGGTCGCCGC GAGCTGCGGA ATCCCGGCCG AGACGACCCG GCGGACTGCT CGCGAGCTCG CCGCCGCGCC GCGAGCCGCC GTCTACGGGC GGATGGGCAC GACCACGGTC GAGTTCGGCA CCCTGACCAG CTGGCTGACG ATCGCCCTCA ACGCGATCAC CGGCAATCTC GACGTCCCGG GCGGCGCCAT GTTCGGCCGG GGTGCGCACG GCCGCGCCGA CCGCGTGGGC GGCGGGCGGG GCGGCGGGCG CGGGTGGCGC ACCGGGCGCT GGCACACCCG CGTCCGCGGT CTGCCGGAGG TGATGGGCGA GCTGCCGGCC GCGGCGCTGG CCGAGGAGAT GGACACCCCC GGAGACGGCC GCCTACGGGC GCTGTTCACC ATCGCGGGCA ACCCGGCGCT CTCCACACCG GACTCCGCCA GGCTCGCCGC GGCACTGGCC CAGCTGGACT TCATGGTGAG CGTCGATCCG TACCTCAACG AGACGTCCCG GCACGCCGAC GTGGTGCTGC CGCCGGCTGA TCCGGCCGGG GTCGGGCACT ACGACTTCGC GCTCGGCGCG CTGGCGGTGC GCACCGTGGC GACCTATTCG CCGCCGGCGC TGCCGCCGTA CCCGGGCGGG ATGGCCGAGC ACGACATCCT GGCCCGCCTC ACCCTCATCG CCCTGGCTCT CGCCGACGAG CCGGCCAACG CTGCCACCAC AGCCACCGCC GGGCCGGCCA CCACCGCTGA ATCAGGCACC ACCGCCGAGC CGGCCGATGT CGACGCGACA GCCGACGCGG AGCTGCCGGT CGAGGACTCC CGGCCAGCCG ATGCCGCGGT CTGGACCGAC ACCGCCGTCG CGGCCTTCCA CGAGTACCTG ATCGAGGAGG CACTGCGGCG GGCGGTCACG GAGCCCGGCT CACCTGTCGC CGGGCGGGAC GTCGCCGAAC TGGGCGCTCT CGTGGACGGC GACGGCGCAC CGGAACGCCT ACTCGACATC GCGCTTCGCA CCGGGCACTT CGGTGACGCG TTCGGTGCGC GGCCCGGCGG GCTGAGCCTG GCCACGCTGC GGGCGAACCC GCACGGGATC GATCTGGGAC CGCTCGAGCC GCGGATCCCG GCAATGCTGC GGACGGCCAG CGCGACGGTG GAGCTGTGCC CCGACCCGAT CGTGGCCGAC GCCGCCCGCC TGCACGCCGC CCTCGACGCC CACGAGGCGG CTGAGGGCCG CGAGCCCTCC ATGACCCCGG GTTCCCCAGA GGCACCGGAG AGCCCGGGCA GCCCGGGACT GACCCTGATT GGCCGGCGTC ACCTGCGGAC GAACAACAGC TGGCTGCACA ACGTCCCGGA GATGGCCCGG GGACGAGATC GCTGCACCCT GCTGGTGCAC CCCGACGACG CCGCCCGCCA CGGCGTCCGC GACGGCGGGT CAGCCCGGAT CACGTCCACC GCCGGCTCCC TCGACGTCCG GGTGTAG
|
Protein sequence | MMKRTSAGTG TGTAAGSVTA LRVCPLCEAT CGLELRITDG RVTAARGDAE HVFSRGYLCA KGASLPQLLG DPDRLRRPLL RDRATGEHRE VDWEKAFAAV HAGLRRVVSA HGPAVVAAYL GNPNAHSMAG AQHRPAFTQA LRTRAVFSAS TVDQMPMHVA CGLVFGHPSL IPVPDLDRTD HLLMLGANPA VSNGSLCTAP DFPGRLAGIR ARGGRVVVVD PRRTRTAALA DEHLPIRPGT DALWLFAIVN VLAAEGLVRL GPLAAHLSGA DSVAELAAPF TPERVAASCG IPAETTRRTA RELAAAPRAA VYGRMGTTTV EFGTLTSWLT IALNAITGNL DVPGGAMFGR GAHGRADRVG GGRGGGRGWR TGRWHTRVRG LPEVMGELPA AALAEEMDTP GDGRLRALFT IAGNPALSTP DSARLAAALA QLDFMVSVDP YLNETSRHAD VVLPPADPAG VGHYDFALGA LAVRTVATYS PPALPPYPGG MAEHDILARL TLIALALADE PANAATTATA GPATTAESGT TAEPADVDAT ADAELPVEDS RPADAAVWTD TAVAAFHEYL IEEALRRAVT EPGSPVAGRD VAELGALVDG DGAPERLLDI ALRTGHFGDA FGARPGGLSL ATLRANPHGI DLGPLEPRIP AMLRTASATV ELCPDPIVAD AARLHAALDA HEAAEGREPS MTPGSPEAPE SPGSPGLTLI GRRHLRTNNS WLHNVPEMAR GRDRCTLLVH PDDAARHGVR DGGSARITST AGSLDVRV
|
| |