Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1765 |
Symbol | |
ID | 5670167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2116816 |
End bp | 2118621 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240686 |
Product | NAD+ synthetase |
Protein accession | YP_001506109 |
Protein GI | 158313601 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG0171] NAD synthase [COG0388] Predicted amidohydrolase |
TIGRFAM ID | [TIGR00552] NAD+ synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0678882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000901061 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCAGT TGCGGATCGC CCTCGCCCAG GTGGACACCA CCGTCGGAGA CCTGGACGGC AACGCGGAGC TGGTCAGCTC CTGGACCAAG CAGGCACTCG CCCGCAGCGC GCACCTGGTC GTGTTCGGCG AGATGACGCT GACCGGCTAC CCGGCCGAGG ATCTCGTCCT GCGCCGCTCC TTCGTGGCCG CCTCCGCCGC CGCCCTCGAG CGCCTGGCGG TCCGGCTCGC CGAGGAGGGC GCCGGCGAGA TCGCCGTCGT CGTCGGCTAT CTCGACGCCT CCCCGACCCC GGCGCCGGCG GTCGGGCGTC CGGCCGGGGA GCCGCAGAAC TGCGTCGCCG TGCTGTGGCA GGGCAGGGTC GCGGCCCGCT CGGCCAAGCA CCACCTGCCC AACTACGGGG TGTTCGACGA GTTCCGCTAC TTCGTGCCGG GCACCACGTT CCCGGTGTTC CGGCTGCACG GCGTCGACGT CGGGCTGACC GTCTGCGAGG ACCTCTGGCA GGAGGGCGGC CCGGTCGCGG TGGCCCGCGA GGTCGGGGTC GGGCTGCTGC TGTGCATCAA CGGGTCACCG TACGAGCAGG GCAAGTCCTA CCACCGGGAC GAGCTGTGCG CGCGCCGCGC CCGCGAGGCC GGCGCGACGC TGGCATACGT CAACCTGGTC GGCGGCCAGG ACGAGCTGGT CTTCGACGGC GACTCGCTGG TCGTCGACGC CGACGGCGAG GTCCTCGCCC GGGCGCCGGT CTTCGCCGAG ACCCTGCTCA CCGTCGATCT CGACCTGCCC GAAGGCGCCG GCACCGGCGT GGGCCCGGAG ACGGGCCCGG ACGGGCCGGT GGACGCCGGC GACGGCACCA CGATGGCCGT CAGCCGCGTC GTGCTCGCCG CCGATCCGCT GCCGGCCTGG GAGCCGCGCC CGGCGACGGT CGCCGACCGG CCCGACCCCG CCGGCGAGCT GTACGCCGCC GTCGTCACCG GGACCCGCGA CTATGTCCGC AAGAACGGCT TCCGCTCGGT GGCGCTCGGG CTCTCCGGCG GCATCGACTC GGCCCTGGTG GCCACGATCG CGGTGGACGC CCTGGGCCCG GACGCCGTGC ACACCGTGGC CATGCCGTCG GCGCACTCGT CCCAGGGCTC GCTGGACGAC GCCGCCGAGC TCGCCCACCG GCAGCGGACC CGGCACAGCG TCGTCCCCAT CGAGCCGACC GTGGCCGCCT TCCACGCCGC GCTCGCGGCC GCGGGGGGCC TGCACGGGCT GGCCGCGGAG AACCTGCAGG CGCGGGTCCG TGGGACGCTG CTGATGGCGC TGTCGAACGA GCACGGCCAC CTGGTTCTGA CGACGGGGAA CAAGAGCGAG CTGGCCACCG GCTTCTCCAC CCTCTACGGC GACAGCGCCG GCGGGTTCGC CCCGATCAAG GACGTCTCGA AGACCAACGT GTGGGTGCTG GCCCGCTGGC GCAACGCGCG GGCGGTCGCC CGCGGCGAGG TCCCCCCCAT CCCGGAGGAG ATCATCGTCA AGCCGCCGTC GGCGGAACTC GCCCCCGGCC AGCTCGACTC CGACCGCCTC CCGGACTATT CGGTCCTCGA CGCCGTGCTC GACGACTACG TCAGCCACGA CCTCGGCCGG GCCGAGCTGA CGGCGGCGGG CCACGACCCG GCGACGGTCG ACCGGGTGAT CCGCCTGGTC GACCTCGCCG AGTACAAGCG CCGGCAGAAC CCGCCCGGGC CGAAGGTGAC GTCCAAGGCG TTCGGCCGCG ACCGCCGGTT GCCGATCACC TCTCGCTGGC GCGAGCGCGA CCACTCCGCG GGCTGA
|
Protein sequence | MAQLRIALAQ VDTTVGDLDG NAELVSSWTK QALARSAHLV VFGEMTLTGY PAEDLVLRRS FVAASAAALE RLAVRLAEEG AGEIAVVVGY LDASPTPAPA VGRPAGEPQN CVAVLWQGRV AARSAKHHLP NYGVFDEFRY FVPGTTFPVF RLHGVDVGLT VCEDLWQEGG PVAVAREVGV GLLLCINGSP YEQGKSYHRD ELCARRAREA GATLAYVNLV GGQDELVFDG DSLVVDADGE VLARAPVFAE TLLTVDLDLP EGAGTGVGPE TGPDGPVDAG DGTTMAVSRV VLAADPLPAW EPRPATVADR PDPAGELYAA VVTGTRDYVR KNGFRSVALG LSGGIDSALV ATIAVDALGP DAVHTVAMPS AHSSQGSLDD AAELAHRQRT RHSVVPIEPT VAAFHAALAA AGGLHGLAAE NLQARVRGTL LMALSNEHGH LVLTTGNKSE LATGFSTLYG DSAGGFAPIK DVSKTNVWVL ARWRNARAVA RGEVPPIPEE IIVKPPSAEL APGQLDSDRL PDYSVLDAVL DDYVSHDLGR AELTAAGHDP ATVDRVIRLV DLAEYKRRQN PPGPKVTSKA FGRDRRLPIT SRWRERDHSA G
|
| |