Gene Ndas_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1984 
Symbol 
ID9245834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2405150 
End bp2406187 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content76% 
IMG OID 
Productaminotransferase class V 
Protein accessionYP_003679916 
Protein GI297560942 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.817831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000368323 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGACA CCATCACCGA CCTGTGGAGC ACCGACACCG TCTGGCTCAA CACCGCCCAG 
TACGGCATCC CGCCCCGGCC CGCCCACCAG GCCCTGGCCG ACGCCGTGCG CTCCTGGCAC
ACCGCCACCG GGAACCCCGC GGCCTGGGGG CGCGAACTCG AACAGGCCCG CGTCAACCTC
GCCGCCCTGG TGGGCGCGCC CGCCGACGAC CTCACCCTGG GCGCCAGCAC CGCCCAGATC
GCCGGGACCA TCGCCGCCAG CCTGCCCGAC GGCGCCCGCG TCCTGGTTCC CGAGGGGGAC
TTCGCCTCGA TCGTCTTCCC CTGGCAGGCC CAGGCCGACC GCGGCGTCAC CGTGGAGGCC
GTCCCCCTGG ACCGCCTGGC CCAGGCCGTG GACGCGCGCA CGCACCTGGT CGCCTTCAGC
ACGGTGCACT CGGCGAACGG ACGCCTGGCC CCCACCGGCG ACATCGTCGC GGCCGCCCGC
GCCCACGGCG CCCTGGTCGT GGCCGACGCC ACCCAGGCCG CCGGATGGAC CCCCCTGGAC
GCCACCGTCT TCGACGCCCT GATCGCCTCG GCCTACAAGT GGCTCATGGC GCCGCGCGGC
CTGGCCCTGG CCTACCTGTC CCCCGGCCTG CGCGCGCGGC TGCGCCCCAA CAACGCCGGC
CCGGCCGCCG CCCGCGACAC CGCCTCGGCG ATGTACGCCG CCCGGATGGA CCCGGCGCCG
ACCGCCCGCG CCTTCGACAC CTCGCCCAAC TGGTTCGCGG CCGTGGCGGC CGCAGCCTCC
AGCCGGGTCC TGCTGGAGGC GGGCCTGGAG AGGGTGCGCG CCCACAACAC CGCCCTGACC
GACCACTTCC GCGCGGCGCT GGGGCAGGAG CCCGCGCACT CGGCCATCAC CAGCGTCGAC
CTGCCCGGCG CCTCCGAGCG CCTGGCCCGG GCCGGGGTGG TGACCACCGA GCGCGGGGGC
CGCACCCGCC TGTCCTTCCA CCTGTACAAC ACCCTCGACG ACGCCGAGCG CGCCGCCAAG
GCCCTGCTCC AGCCCTAA
 
Protein sequence
MDDTITDLWS TDTVWLNTAQ YGIPPRPAHQ ALADAVRSWH TATGNPAAWG RELEQARVNL 
AALVGAPADD LTLGASTAQI AGTIAASLPD GARVLVPEGD FASIVFPWQA QADRGVTVEA
VPLDRLAQAV DARTHLVAFS TVHSANGRLA PTGDIVAAAR AHGALVVADA TQAAGWTPLD
ATVFDALIAS AYKWLMAPRG LALAYLSPGL RARLRPNNAG PAAARDTASA MYAARMDPAP
TARAFDTSPN WFAAVAAAAS SRVLLEAGLE RVRAHNTALT DHFRAALGQE PAHSAITSVD
LPGASERLAR AGVVTTERGG RTRLSFHLYN TLDDAERAAK ALLQP