Gene Ndas_4408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4408 
Symbol 
ID9248283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5244528 
End bp5245697 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content72% 
IMG OID 
Productglycosyltransferase, MGT family 
Protein accessionYP_003682303 
Protein GI297563329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACCGCC GCGCCCACAT CGCCATGGTC GGCACCCCCA CCGTGAGCCA CGTCCTGCCC 
AGCCTGGAGG TCATCCGCGA GCTGGTGGAC CGGGGCCACC GGGTCACCTA CGCCAACGAC
CCCCTGCTGG CCGACACCAT CACCGGCGTC GGGGCCGAAC TCGTCCCCTA CTCCACCATC
CTGCCCACCG GCGACGGCAC CTGGCCGGAC GACCCGGTCA AGGGCATGGA CATCTTCCTG
GACGAGGCGA TCAACCAGCT GCCCGCCCTG CGCGCCGCCT ACGACGGAGA CCGTCCCGAC
CTGTTCCTGT ACGACATCTC GGGGTTCGCC GCCCGGGTCC TCTCCGTCAA CTGGGACATC
CCCTCCGTGC AGCTCTCGCC CACCTACGTG GCCTGGGCCG ACTACGAGGA CACCGTCCTG
AAGTGGCTGC GCGCCCAGCC CGGGGCCGAG GAGCACTACG CCAAGCTCGA CGCCTGGCTC
GCCGACAACG GCGTCACCGG CCTCGACCAC TCCTCCTTCG CCGGGGTGCC GGAGCGGGCG
CTGGCGCTGA TCCCACGCGA GATGCAGCCC TTCGCCGACA CCGTGGCCGA GACGGTGACC
TTCGTGGGGC CGTGCCTGGG TGACCGGGCC GACCAGGGCG AATGGACCCG TCCGGCGGAC
GCCGACAACG TCCTGCTGGT CTCCCTGGGG TCGGCGTTCA CCAACCAGCC GGGGTTCTAC
CGCGCCTGCC TGGAGGCCTT CGGGGACCTG CCCGGCTGGC ACGTGGTGCT CCAGATCGGC
AAGTACGTGG ACCCCGCCGA GCTGGGGGAG GTGCCGGGCA ACGTCGAGGT GCACACGTGG
GTGCCGCAGC TGGCCGTCCT GCGCCAGGCC GACGCCTTCG TCACCCACGC CGGTATGGGC
GGCTCCAGCG AGGGCCTCTA CACGGGGGTG CCGATGATCG CCGTCCCGCA GGCCGTCGAC
CAGTTCGACA ACGCCGACCG ACTGGTGGAA CTCGGCGTCG CCCGGAGGAT CGACACCGGG
GAGGCCAGCG CGGAGCGGCT GCGCTCGGCC CTGCTGGAGC TGACCGCCGA CCCCGGGGTC
GCCCGCCGCC TCGCCGAGGT CAGCGCCCGG CTCCAGGCCA GCGGTACCTC CTACGCGGCC
GACCTGGTCG AGGCGGAACT GCCCGCCTGA
 
Protein sequence
MNRRAHIAMV GTPTVSHVLP SLEVIRELVD RGHRVTYAND PLLADTITGV GAELVPYSTI 
LPTGDGTWPD DPVKGMDIFL DEAINQLPAL RAAYDGDRPD LFLYDISGFA ARVLSVNWDI
PSVQLSPTYV AWADYEDTVL KWLRAQPGAE EHYAKLDAWL ADNGVTGLDH SSFAGVPERA
LALIPREMQP FADTVAETVT FVGPCLGDRA DQGEWTRPAD ADNVLLVSLG SAFTNQPGFY
RACLEAFGDL PGWHVVLQIG KYVDPAELGE VPGNVEVHTW VPQLAVLRQA DAFVTHAGMG
GSSEGLYTGV PMIAVPQAVD QFDNADRLVE LGVARRIDTG EASAERLRSA LLELTADPGV
ARRLAEVSAR LQASGTSYAA DLVEAELPA