Gene Ndas_4220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4220 
Symbol 
ID9248094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5038442 
End bp5040145 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content76% 
IMG OID 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_003682118 
Protein GI297563144 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA CCGAACCCCT CCCCGCCGCA ACCGGTGACA CCCGGCTGAC GGGTATCCCC 
GCCAGCCCGG GAGCCGCAGT CGGCCCGGTG GCGCGGATGG CCCCGCCGCC GTCCCTGCCC
GACCAGCGGC CCACCGTCGT GGACCCGGGA GAGGAGGCCG CCCGGGCACG GCAGGCCCTG
GAACAGGTGG CCGAGCTGCT CGAAAGCCAG GCCGCCGACC TCCAGGGCGA CGCGGCGGCC
GTGCTGAGCG CCCAGGCGCT CATGGCCCGC GACCCCGAAC TCGCGCGCCG CGCCGCGGAG
CGCACCGATG CCGGGGACCC CGCGGCCTGG GCCGTCCACG CGGCCTGCGG CGCCTACCAG
GAGCTGCTCC AGGCGGCGGG CGGCTACCTC GCCGAACGCG CCGCCGACCT CGCCGACATC
CGGGACCGGG CGACCGCCGT GCTCCTGGAG CTGCCGATGC CCGGGCTGCC CGACCCCGGC
GTCCCGCACG TGCTGGTCGC CGACGACCTC GCGCCCGCCG ACACCGTGCA GCTCGACGTC
GAGCGCGTGC TCGCCATCGT CACGCGTTAC GGCGGACCCA CCAGCCACAC GGTCATCCTG
GCCCGGTCGC TCGGCATCCC GGCCGTGGTC GGCTGCGCGG GGGCCGCCGA ACTGCTCGAC
GCGACGCCGG TGGCGGTGGA CGGCGACGCC GGAACGGTCG AGGTCGAGCC CGACGAGGAG
CTCGCCGAGC GGATGCGCGA GCGCACCGAG CGGCGGCGCG CCCTGCTGGC CGAGGCCACC
GGTCCGGGCC GGACCGCCGA CGACGTTCCG GTCTCGCTGC TGCTCAACGT CGGTGAGGGC
GACCCGCAGA AGGCCGGTGC CCACGACAGC GAGGGCGTGG GGCTGCTGCG CACCGAGTTC
CTCTTCCTGG AGCGCTCGGA CCCGCCCACC GTCGAGGAGC AGACCGCCTC CTACACGGCC
CTGTTCGAGG CGTTCGCCGG GCGCACGGTC ACGGTGCGGA CCCTGGACGC CGGGTCGGAC
AAGCCCCTGG CCTTCGCGTC CACCGGCCAC GAGGACAACC CCGCGCTCGG CGTACGGGGT
TTCCGCACCT CGCGGGTGCA CCCCCACCTG ATCACCGACC AGCTCCAGGC CATCGCCGCC
GCCACCCGGG AGACCTCCGC CAAGGTCCGG GTCATGGCGC CGATGGTGTC CACGCCCGAC
GAGGCGGAGG AGTTCGTCTC CCTCGCCCGC GAGGCCGGGA TCGACCAGGT CGGCGTGATG
ATCGAGGTGC CCGGCGCGGC GCTGCTCGCC GACCGGCTGC TGTCCCGGGT GGACTTCGTG
TCGATCGGCA CCAACGACCT GGCGCAGTAC ACCATGGCAA CCGACCGGAC CCTCGGCGCG
CTGCCCGACC TGCTCGACCC CTGGCAGCCC GCGCTGCTCC AGCTGGTGGG CACCGTGGGC
GGCGCGGGGG AGCGCTCGGG ACGCCCGGTG GGCGTGTGCG GCGAGGCCGC CGCCGACCCG
CTGCTGGCCC TGGTGCTGGT GGGACTGGGC GCGACCAGCC TGTCGATGTC CGCGCCCGCG
CTGCCCGCCG TGCGCCACGC GCTGGCCCTG CACACCCACA GTGACTGCCG CAGACTGGCC
GACCTGGCGG TCGCGGCCTC CAGCGCCGAA CAGGCGCGCG AGGCGGTCAG GGCCGCCGCC
CACCCGGAGG CGGTCGACCG CTGA
 
Protein sequence
MAKTEPLPAA TGDTRLTGIP ASPGAAVGPV ARMAPPPSLP DQRPTVVDPG EEAARARQAL 
EQVAELLESQ AADLQGDAAA VLSAQALMAR DPELARRAAE RTDAGDPAAW AVHAACGAYQ
ELLQAAGGYL AERAADLADI RDRATAVLLE LPMPGLPDPG VPHVLVADDL APADTVQLDV
ERVLAIVTRY GGPTSHTVIL ARSLGIPAVV GCAGAAELLD ATPVAVDGDA GTVEVEPDEE
LAERMRERTE RRRALLAEAT GPGRTADDVP VSLLLNVGEG DPQKAGAHDS EGVGLLRTEF
LFLERSDPPT VEEQTASYTA LFEAFAGRTV TVRTLDAGSD KPLAFASTGH EDNPALGVRG
FRTSRVHPHL ITDQLQAIAA ATRETSAKVR VMAPMVSTPD EAEEFVSLAR EAGIDQVGVM
IEVPGAALLA DRLLSRVDFV SIGTNDLAQY TMATDRTLGA LPDLLDPWQP ALLQLVGTVG
GAGERSGRPV GVCGEAAADP LLALVLVGLG ATSLSMSAPA LPAVRHALAL HTHSDCRRLA
DLAVAASSAE QAREAVRAAA HPEAVDR