Gene Ndas_0164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0164 
Symbol 
ID9243995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp205968 
End bp207206 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content72% 
IMG OID 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003678120 
Protein GI297559146 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTACG TGTCCCTCCC CCATTCCGTC GGGCGCGCGA CTCTCGCCGC CTCCGCCGCC 
GCGCTCCTCG CCTTCGGGCT CGCCGCTCCG GCCTCCGCCG ACCCGGTGGA GACCTACGAG
GGCTCCGTCC GCGCCCAGTA CCCCGCCACC GCCGCGTCCG GCGTGGACGT TCGGATCAAC
GGTGAGATGG AGAGCACGAG TCTCTTCGAC CTCCGGTTGG AGAACGGCAC CGCCCTCACC
GCCTACTGCA TCGACCTTGA GACCAGGATC AAGGACAACG CCTGGTACCT GGAGGACGAC
TGGGCGAACT ACCCGGGCAG GGGCGACTTC GCGGAGCCGG GCAAGGTGCA CTGGATCCTC
CAGAACAGCT ACCCGACCGT CAGCGCCGCC CAGCTCGCCG AGAACGCGGG GCTGAATCGG
GGCAACGCCC GCCACTTCGG TGATGAAGAG GCGATCGCCG CCACCCAGGC CGCCATCTGG
CACTTCAGCA ACGGCGCCGA GGTGACCGCG AACGACCCGA ACGGCGTCAG GGCGGTCTAT
GACTACCTGG TCGGGGAAGC CCAGGACCTC CCGCAGGAGC CCGGTCCGAC CCTGAGCATC
ACCCCGGGCG AGGCCTCCGG CAGCGCGGGC GAGACGATCG GCGAGTTCCT CGTCGAGACC
AGCGACGCGG ACGGCATCGA GGTCAGCGTC CAGGCCCCCG AGGGTGTCGA GGTCGAGCTG
GTCGACCTGG AGACCGGCCA GCCCGTCACC ACGGTCAACA ACGGTGACAC CGTCGGCCTG
GCCGTTCCGG AGGGCGCGGC GGAGGGCACC GCCTCCTTCT CCCTGGAGAC CACGGCCACC
GTGCGGTCCG GCCGCCTGTT CAAGGGCGAG GAGGAGTACC AGCCGACCCA GACCCTGATC
ACCGCCCAGG ACAGCGAGGT CACCGTCTCC GCCTCGGCCT CGGTCTCCTG GACCGGCGGC
GGCGAGACCC CGCCCCCCAC GGAGGAGCCG AGCGAGGAGC CCTCCGAGGA GCCGAGCGAG
CCCGAGAGCC CGGAGCCGAC CCCGAGCGAC GAGCCCTCCG AGCCCGTCGA CAAGCCGTCC
GAGCCCGCCG ACGACCAGAA CGAGCCCAGC CTGCCGGTGA CCGGTGGCGC GCTCGTCGGC
CTGGTCGCCG CCGGTGTGGC CGCGCTCGGC GCGGGCGGCG GCGCCCTCTA CCTGAGCCGC
AGGCGCAAGG CGGCGGGCAG CCAGGACCTG GAGGGCTAG
 
Protein sequence
MTYVSLPHSV GRATLAASAA ALLAFGLAAP ASADPVETYE GSVRAQYPAT AASGVDVRIN 
GEMESTSLFD LRLENGTALT AYCIDLETRI KDNAWYLEDD WANYPGRGDF AEPGKVHWIL
QNSYPTVSAA QLAENAGLNR GNARHFGDEE AIAATQAAIW HFSNGAEVTA NDPNGVRAVY
DYLVGEAQDL PQEPGPTLSI TPGEASGSAG ETIGEFLVET SDADGIEVSV QAPEGVEVEL
VDLETGQPVT TVNNGDTVGL AVPEGAAEGT ASFSLETTAT VRSGRLFKGE EEYQPTQTLI
TAQDSEVTVS ASASVSWTGG GETPPPTEEP SEEPSEEPSE PESPEPTPSD EPSEPVDKPS
EPADDQNEPS LPVTGGALVG LVAAGVAALG AGGGALYLSR RRKAAGSQDL EG