Gene Ndas_4171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4171 
Symbol 
ID9248045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4982234 
End bp4983364 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID 
Productphosphoserine aminotransferase 
Protein accessionYP_003682072 
Protein GI297563098 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.309749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGTGA CCGACGAGAT TCAGATCCCC GCGAACCTGC TGCCCTCCGA CGGCCGTTTC 
GGCAGCGGCC CGTCCAAAGT CCGCCCCGCC CAGATCGAGG CCCTGGCCGC CTCCGGCTCC
CGCTACATGG GCACCTCCCA CCGGCAGAAG CCGGTCAAGT CCCTGGTCTC CCGGGTCCGC
TCCGGGGTGA GCGAGCTGTT CTCCCTCCCC GACGGCTACG AGGTGGTCCT CGGCAACGGC
GGCACCACCG CCTTCTGGGA CATCGCCGCG CACGGCCTGC TGCGCGAGAA GTCCCAGCAC
CTCGCCTTCG GCGAGTTCTC CAGCAAGTTC GCGAAGGTCG CCAAGGGCGC GCCCTGGCTC
CAGGAGCCCA CCGTCATCAG CACCGACCCG GGCAGCCACA GCGAGCCCGC GGCCGAGGCC
GGCGTGGACG TGTACGCGCT GACCCACAAC GAGACGTCCA CCGGTGTGGC CGCGCCCATC
AGGCGCGTGG CGGGCGCCGA CGAGGACGCG CTCGTCCTCG TGGACGCCAC CAGCGGCGCG
GGCGGCCTGC CGGTCGACAT CGCCGAGACC GACGTCTACT ACTTCGCGCC GCAGAAGAGC
TTCGCCGCCG ACGGCGGGCT GTGGCTGGCC GTCATGTCGC CCCGGGCCCT GGCCCGCGTG
GAGGAGATCG CGGCGAGCGG CCGCTACGTG CCGGAGTTCT TCTCGCTGCC CACGGCGATC
GACAACTCCC GCAAGGACCA GACCTACAAC ACCCCGGCCG TGGCCACGCT GCTGCTGCTC
GCCGAGCAGC TGGAGTGGAT GAACGGCCAG GGCGGCCTGG AGTGGACCGT GGCGCGCACC
GCCGAGTCCT CCTCGGTCCT CTACGACTGG GCGGAGAAGT CGCCGGTCGC GACGCCGTTC
GTCACCGACC CGTCCAAGCG CTCTCAGGTG GTCGGCACCA TCGACTTCAG CGACGACGTG
GACGCCGCCG CGGTGGCCCG GGTCCTGCGC GCCAACGGCG TGGTCGACAC CGAGCCCTAC
CGCAAGCTGG GCCGCAACCA GCTGCGCGTG GCCATGTTCC CGGCGATCGA CCCGGACGAC
GTGCGCGCGC TCACCGAGTG CGTGGACCAC GTGCTCACCG AGCTGTCCTG A
 
Protein sequence
MVVTDEIQIP ANLLPSDGRF GSGPSKVRPA QIEALAASGS RYMGTSHRQK PVKSLVSRVR 
SGVSELFSLP DGYEVVLGNG GTTAFWDIAA HGLLREKSQH LAFGEFSSKF AKVAKGAPWL
QEPTVISTDP GSHSEPAAEA GVDVYALTHN ETSTGVAAPI RRVAGADEDA LVLVDATSGA
GGLPVDIAET DVYYFAPQKS FAADGGLWLA VMSPRALARV EEIAASGRYV PEFFSLPTAI
DNSRKDQTYN TPAVATLLLL AEQLEWMNGQ GGLEWTVART AESSSVLYDW AEKSPVATPF
VTDPSKRSQV VGTIDFSDDV DAAAVARVLR ANGVVDTEPY RKLGRNQLRV AMFPAIDPDD
VRALTECVDH VLTELS