Gene Ndas_5107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5107 
Symbol 
ID9248999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp254078 
End bp255295 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content70% 
IMG OID 
Productaminotransferase class I and II 
Protein accessionYP_003682994 
Protein GI297564021 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.294065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.505049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGCA TGACTGATCG ACCCCGTCTC TCTGCACGTA TCAGCGCGAT CTCCGAGTCC 
GCCACCCTGG CTGTGGACGC CAAGGCCAAG GCGATGAAGG CGGAGGGCCG TGCCGTCATC
GGCTTCGGCG CGGGAGAGCC CGACTTCCCG ACCCCCGACT ACATCGTCGA GGCCGCCGTC
GAGGCCGCCC GTGAGCCCCG GTTCCACCGC TACACGCCCG CCGGCGGCCT GCCCGAGCTC
AAGAAGGCCA TCGCCGAGAA GACCCTGCGC GACTCCGGCT ACGAGGTCGA CCCCGCCCAG
GTCCTGGTGA CCAACGGCGG CAAGCAGGCC ATCTACGAGG CCTTCGCCGC CATGCTGGAC
CCGGGCGACG AGGTCATCGT CATCGCCCCG TACTGGACCA CCTACCCCGA GTCGATCAAG
CTCGCGGGCG GCGTCCCGGT CTTCGTCGTC ACCGACGAGA GCACCGGCTA CCTGGCCAGT
GTGGAGCAGC TGGAGGCGGC GCGCAGCGAG CGCACCAAGG TCCTGGTGTT CGTCTCCCCG
TCCAACCCGA CCGGCGCCGT GTACCCGCGC GAGCAGGTCC GCGAGATCGG CCGCTGGGCG
AACGAGCACG GCCTGTGGGT GCTGTCCGAC GAGATCTACG AGCACCTGGT CTACGGGGAC
GCCGAGTTCT CCTCGCTGCC CGTCGAGGTG CCCGAGATCG CCGACCGCAC CGTCATCGTC
AACGGCGTGG CCAAGACCTA CGCCATGACC GGCTGGCGCG TGGGGTGGAT CATCGGCCCC
AAGGACGTGG TCAAGGCCGC GGGCAACCTC CAGTCGCACG CCACCTCCAA CGTCGCCAAC
GTCTCGCAGG CCGCCGCCCT GGCCGCGGTC TCCGGCGACC TGGACGCCGT GGCGACGATG
CGCGAGGCCT TCGACCGCCG CCGCAAGACG ATCGTGCGCA TGCTCAACGA GATCGACGGC
GTGGTCTGCC CCGAGCCGCA GGGCGCGTTC TACGCCTACC CCTCGGTCAA GGGCGTGCTG
GGCAAGGAGA TCCGGGGCAG GACGCCGCAG ACCTCCACCG AGCTGGCCGA GCTCATCCTG
GAGCAGGCCG AGGTCGCCGT GGTGCCCGGT GAGGCCTTCG GCACCCCCGG CTACCTGCGC
CTGTCCTACG CGCTCAGCGA CGAGGACCTC GCCGAGGGCG TGAGCCGCAT CCAGAAGCTG
CTGGCCGAGG CCAAGTAG
 
Protein sequence
MGRMTDRPRL SARISAISES ATLAVDAKAK AMKAEGRAVI GFGAGEPDFP TPDYIVEAAV 
EAAREPRFHR YTPAGGLPEL KKAIAEKTLR DSGYEVDPAQ VLVTNGGKQA IYEAFAAMLD
PGDEVIVIAP YWTTYPESIK LAGGVPVFVV TDESTGYLAS VEQLEAARSE RTKVLVFVSP
SNPTGAVYPR EQVREIGRWA NEHGLWVLSD EIYEHLVYGD AEFSSLPVEV PEIADRTVIV
NGVAKTYAMT GWRVGWIIGP KDVVKAAGNL QSHATSNVAN VSQAAALAAV SGDLDAVATM
REAFDRRRKT IVRMLNEIDG VVCPEPQGAF YAYPSVKGVL GKEIRGRTPQ TSTELAELIL
EQAEVAVVPG EAFGTPGYLR LSYALSDEDL AEGVSRIQKL LAEAK