Gene Ndas_3781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3781 
Symbol 
ID9247650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4543986 
End bp4545161 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content76% 
IMG OID 
Productaminotransferase class V 
Protein accessionYP_003681685 
Protein GI297562711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.559401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.819909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTATC TGGATCACGC TGCCACGACC GACGTGCGCC CCGAGGTCGC CGCCGACGTC 
GCGGCCGAGC TCGGCGCCCT CGGGAACCCG TCGTCCCTGC ACGGCCACGG GCGCGCCGCG
CGCCGCACGG TGGAGGAGGC GCGCGAACGC CTGGCCGGTG CGCTCGGCTC GACCCCGCAC
GAGGTGGTCT TCACCGGGGG CGGCACCGAG TCGGACAACA TCGCCGTCAA GGGCCTGTAC
TGGGCGCGCA ACACGCCCGA CGCACGCAGG CCCAGGATTC TCACCAGTGC CGTCGAGCAC
CACGCCGCCC TCGACCCCGC GCACTGGATG GCCGAGCACC AGGGGGCCGT GCACGAGAAG
ATCCCCGTGG ACGCGCTGGG CCGGGTGGAC CCCGAGGTCC TGCGGGAGGC GATCGAGGGC
GGCTCCGGCC CCGAGGCGGT CGCGCTCGTC TCGGTCATGT GGGCCAACAA CGAGGTGGGC
ACGGTGCAGC CGGTCCGGGA GCTGGCCGCC GTCGCCGCCG AGTACGGCAT CCCCTTCCAC
ACCGACGCCG TCCAGGCCGT GGGCGTGGAA CCGGTCGACT TCGCCGGGAG CGGGGTGAGC
GCGCTGACGG TCAGCGGCCA CAAGCTCGGC GGCCCGGTGG GCGCGGGCGC GCTGCTGGTC
GCGCGCGGCC TGGCCCCGGT GCCGGTCCTG CACGGTGGCG GCCAGGAGCG CGACATCCGC
TCGGGCACCC TGTCCCCGCC GCTGCTGCGC GGACTGGCCA CGGCCGTCCA CCTGGCGGTC
GCCGAACGCG AGGAGCACTC CAAGCACCTG GCGGGCCTGC GGGACGAGCT GGAGGAGCGG
GTCCGCGAGG CGGTGCCCGA CGCGGTCGTC AACGGCGACC CCGCGCGCAG GCTGCCGGGG
ATCTCGCACA TCTCCTTCCC CGGCTGCGAG GGGGACGCGC TGCTCATGCT CCTGGACGCC
AAGGGCATTT CCTGCTCGAC CGGCTCGGCC TGCTCGGCGG GGGTGGCCCA GCCCAGCCAC
GTGCTGCTGG AGATGGGCGC CGACGCCGGG ACCGCGCTGA GCAGCCTGCG TCTGTCGCTG
GGCCGCACCT CCACCCGGGA GGACGTGGCG GCGCTGGCGC GGGCCATCGG ACCGGCCGTG
GAGCGGGCCA GGGCCGCCCG CGCCAGACGC CGTTAG
 
Protein sequence
MVYLDHAATT DVRPEVAADV AAELGALGNP SSLHGHGRAA RRTVEEARER LAGALGSTPH 
EVVFTGGGTE SDNIAVKGLY WARNTPDARR PRILTSAVEH HAALDPAHWM AEHQGAVHEK
IPVDALGRVD PEVLREAIEG GSGPEAVALV SVMWANNEVG TVQPVRELAA VAAEYGIPFH
TDAVQAVGVE PVDFAGSGVS ALTVSGHKLG GPVGAGALLV ARGLAPVPVL HGGGQERDIR
SGTLSPPLLR GLATAVHLAV AEREEHSKHL AGLRDELEER VREAVPDAVV NGDPARRLPG
ISHISFPGCE GDALLMLLDA KGISCSTGSA CSAGVAQPSH VLLEMGADAG TALSSLRLSL
GRTSTREDVA ALARAIGPAV ERARAARARR R