Gene Ndas_3941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3941 
Symbol 
ID9247812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4712635 
End bp4714245 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content70% 
IMG OID 
Productcarboxyl transferase 
Protein accessionYP_003681844 
Protein GI297562870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0385716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCG AAGCCCCTGA ACCGCTGTCC GCGGACGAGA TCGACATCCA CACGACCGCG 
GGCAAGCTCG CCGACCTGCA ACGACGCCGG TACGAGGCCG TACACGCAGG GTCTGCCCGA
GCCGTTGAGA AACAGCACGC CAAAGGCAAG ATGACCGCCC GCGAGCGGAT CGACGCACTG
CTCGATCCCG GCTCGTTCGT GGAGTTCGAC GCCCTGGCCC GACACCGCTC CACCAGCTTC
GGCCTGGAGA GCAACCGCCC CTACGGTGAC GGCGTCGTCA CCGGCCACGG AACGGTCGAC
GGACGTCCCG TGGCGGTGTT CAGCCAGGAC GTCACCGTCT TCGGCGGATC GCTGGGCGAG
GTCTACGGCG AGAAGATCGT CAAGGTCCTC GACCACGCCC TCACCAACGG GTGCCCCGTC
GTGGGCATCA ACGAGGGCGG CGGCGCGCGC ATCCAGGAGG GCGTGGTCGC GCTCGGCCTG
TACGCCGAGA TCTTCAAGCG CAACACCCAC GCCTCGGGCG TCATCCCGCA GATCTCCCTG
ATCATGGGCG CGGCCGCGGG CGGCCACGTC TACTCCCCCG CCCTCACCGA CTTCGTCGTG
ATGGTGGACG AGACCTCGCA GATGTTCATC ACCGGCCCCG ACGTCATCAA GACCGTCACC
GGCGAGGACG TCTCCATGGA GGAGCTGGGC GGCGCCCGCA CCCACAACAC CAGGTCGGGT
GTGGCGCACT ACATGGGCGC CGACGAGCAG GACGCGATCG AGTACGTGAA GACGCTGCTC
TCGCACCTGC CCGACAACAA CCTGGAGGAG GCGCCGCAGC TGCCCCCCGA GGACGCCCCG
GGCGACGAGG CCACCGACGC CGACCTGGCC CTGGACGCGT TCATCCCGGA CTCGGCCAAC
CAGCCCTACG ACATGCGCAC GGTGGTCGAG GCCGTCCTGG ACGACGGCGA CTTCCTGGAG
GTCCACGCCC AGTTCGCCAC GAACATGGTG GTGGGCTTCG GCCGCGTGGA CGGCCAGTCC
GTCGGCATCG TCGCCAACCA GCCCCTGAGC CTGGCGGGCT GCCTGGACAT CGACGCCTCC
GAGAAGGCCG CGCGCTTCGT GCGCACCTGC GACGCCTTCA ACGTGCCGGT GCTGACCTTC
GTGGACGTGC CCGGGTTCCT GCCCGGCACC GACCAGGAGT GGGACGGCAT CATCCGCCGC
GGCGCCAAGC TGCTGTACGC CTACGCCGAG GCCACGGTGC CCCTGATCAC CGTCATCACG
CGCAAGGCCT TCGGCGGCGC CTACGACGTC ATGGGCTCCA AGCACCTGGG CGCCGACGTC
AACCTGGCCT GGCCCACGGC GCAGATCGCG GTCATGGGCG CCCAGGGCGC GGTGAACATC
CTGCACCGGC GCACGCTGGC CGCCGCCGAC GACGTCGAGG CCGAGCGCAC GCGGCTGGTC
GGCGAGTACG AGGACACCCT CCTCAACCCC TACTCCGCGG CGGAGCGGGG CTACGTGGAC
GGGGTCATCA TGCCCTCCGA GACCCGCGTC CGGATCGCCA AGTCCCTCAA GGCGCTGCGC
AACAAGCGCA AGCAGCTGCC GCCCAAGAAG CACGGGAACA TCCCGCTGTG A
 
Protein sequence
MATEAPEPLS ADEIDIHTTA GKLADLQRRR YEAVHAGSAR AVEKQHAKGK MTARERIDAL 
LDPGSFVEFD ALARHRSTSF GLESNRPYGD GVVTGHGTVD GRPVAVFSQD VTVFGGSLGE
VYGEKIVKVL DHALTNGCPV VGINEGGGAR IQEGVVALGL YAEIFKRNTH ASGVIPQISL
IMGAAAGGHV YSPALTDFVV MVDETSQMFI TGPDVIKTVT GEDVSMEELG GARTHNTRSG
VAHYMGADEQ DAIEYVKTLL SHLPDNNLEE APQLPPEDAP GDEATDADLA LDAFIPDSAN
QPYDMRTVVE AVLDDGDFLE VHAQFATNMV VGFGRVDGQS VGIVANQPLS LAGCLDIDAS
EKAARFVRTC DAFNVPVLTF VDVPGFLPGT DQEWDGIIRR GAKLLYAYAE ATVPLITVIT
RKAFGGAYDV MGSKHLGADV NLAWPTAQIA VMGAQGAVNI LHRRTLAAAD DVEAERTRLV
GEYEDTLLNP YSAAERGYVD GVIMPSETRV RIAKSLKALR NKRKQLPPKK HGNIPL