Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4220 |
Symbol | |
ID | 9248094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5038442 |
End bp | 5040145 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_003682118 |
Protein GI | 297563144 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGA CCGAACCCCT CCCCGCCGCA ACCGGTGACA CCCGGCTGAC GGGTATCCCC GCCAGCCCGG GAGCCGCAGT CGGCCCGGTG GCGCGGATGG CCCCGCCGCC GTCCCTGCCC GACCAGCGGC CCACCGTCGT GGACCCGGGA GAGGAGGCCG CCCGGGCACG GCAGGCCCTG GAACAGGTGG CCGAGCTGCT CGAAAGCCAG GCCGCCGACC TCCAGGGCGA CGCGGCGGCC GTGCTGAGCG CCCAGGCGCT CATGGCCCGC GACCCCGAAC TCGCGCGCCG CGCCGCGGAG CGCACCGATG CCGGGGACCC CGCGGCCTGG GCCGTCCACG CGGCCTGCGG CGCCTACCAG GAGCTGCTCC AGGCGGCGGG CGGCTACCTC GCCGAACGCG CCGCCGACCT CGCCGACATC CGGGACCGGG CGACCGCCGT GCTCCTGGAG CTGCCGATGC CCGGGCTGCC CGACCCCGGC GTCCCGCACG TGCTGGTCGC CGACGACCTC GCGCCCGCCG ACACCGTGCA GCTCGACGTC GAGCGCGTGC TCGCCATCGT CACGCGTTAC GGCGGACCCA CCAGCCACAC GGTCATCCTG GCCCGGTCGC TCGGCATCCC GGCCGTGGTC GGCTGCGCGG GGGCCGCCGA ACTGCTCGAC GCGACGCCGG TGGCGGTGGA CGGCGACGCC GGAACGGTCG AGGTCGAGCC CGACGAGGAG CTCGCCGAGC GGATGCGCGA GCGCACCGAG CGGCGGCGCG CCCTGCTGGC CGAGGCCACC GGTCCGGGCC GGACCGCCGA CGACGTTCCG GTCTCGCTGC TGCTCAACGT CGGTGAGGGC GACCCGCAGA AGGCCGGTGC CCACGACAGC GAGGGCGTGG GGCTGCTGCG CACCGAGTTC CTCTTCCTGG AGCGCTCGGA CCCGCCCACC GTCGAGGAGC AGACCGCCTC CTACACGGCC CTGTTCGAGG CGTTCGCCGG GCGCACGGTC ACGGTGCGGA CCCTGGACGC CGGGTCGGAC AAGCCCCTGG CCTTCGCGTC CACCGGCCAC GAGGACAACC CCGCGCTCGG CGTACGGGGT TTCCGCACCT CGCGGGTGCA CCCCCACCTG ATCACCGACC AGCTCCAGGC CATCGCCGCC GCCACCCGGG AGACCTCCGC CAAGGTCCGG GTCATGGCGC CGATGGTGTC CACGCCCGAC GAGGCGGAGG AGTTCGTCTC CCTCGCCCGC GAGGCCGGGA TCGACCAGGT CGGCGTGATG ATCGAGGTGC CCGGCGCGGC GCTGCTCGCC GACCGGCTGC TGTCCCGGGT GGACTTCGTG TCGATCGGCA CCAACGACCT GGCGCAGTAC ACCATGGCAA CCGACCGGAC CCTCGGCGCG CTGCCCGACC TGCTCGACCC CTGGCAGCCC GCGCTGCTCC AGCTGGTGGG CACCGTGGGC GGCGCGGGGG AGCGCTCGGG ACGCCCGGTG GGCGTGTGCG GCGAGGCCGC CGCCGACCCG CTGCTGGCCC TGGTGCTGGT GGGACTGGGC GCGACCAGCC TGTCGATGTC CGCGCCCGCG CTGCCCGCCG TGCGCCACGC GCTGGCCCTG CACACCCACA GTGACTGCCG CAGACTGGCC GACCTGGCGG TCGCGGCCTC CAGCGCCGAA CAGGCGCGCG AGGCGGTCAG GGCCGCCGCC CACCCGGAGG CGGTCGACCG CTGA
|
Protein sequence | MAKTEPLPAA TGDTRLTGIP ASPGAAVGPV ARMAPPPSLP DQRPTVVDPG EEAARARQAL EQVAELLESQ AADLQGDAAA VLSAQALMAR DPELARRAAE RTDAGDPAAW AVHAACGAYQ ELLQAAGGYL AERAADLADI RDRATAVLLE LPMPGLPDPG VPHVLVADDL APADTVQLDV ERVLAIVTRY GGPTSHTVIL ARSLGIPAVV GCAGAAELLD ATPVAVDGDA GTVEVEPDEE LAERMRERTE RRRALLAEAT GPGRTADDVP VSLLLNVGEG DPQKAGAHDS EGVGLLRTEF LFLERSDPPT VEEQTASYTA LFEAFAGRTV TVRTLDAGSD KPLAFASTGH EDNPALGVRG FRTSRVHPHL ITDQLQAIAA ATRETSAKVR VMAPMVSTPD EAEEFVSLAR EAGIDQVGVM IEVPGAALLA DRLLSRVDFV SIGTNDLAQY TMATDRTLGA LPDLLDPWQP ALLQLVGTVG GAGERSGRPV GVCGEAAADP LLALVLVGLG ATSLSMSAPA LPAVRHALAL HTHSDCRRLA DLAVAASSAE QAREAVRAAA HPEAVDR
|
| |