Gene Namu_5131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5131 
Symbol 
ID8450762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5718700 
End bp5719890 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content72% 
IMG OID645044165 
Productaminotransferase class I and II 
Protein accessionYP_003204389 
Protein GI258655233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAT CGGGCCGTCG CGGCGACGAG ATCATCGGCA AAGCCGCCGG TCTCGGGCCG 
TCCCACCGGT CCCGGGTGGC GCCGTTCTAC GTGATGCAGG TGCTCGCCGC GGCGGCCCGT
CGGCGGGCCG CCGGCGAGGT GGTCTGGGAC CTGGCCGCCG GGCAGCCGTC CACCCCGGCC
CCGGAGCCCG TGCGGCGGGC GGCGCACGCC ACGCTCGACT CGCACATCCT GGGATACACC
GAGGCGCCGG GGATCCGGCC CCTGCGGGAG GCTATCGCCG GGCACTACCG CGACCGCTAC
GCGCTGGCCG TGGACGCTGA CAACGTCGTC GTCACCACCG GGTCCTCGGG CGGATTCCTG
CTCGCCTTCC TGGCCGCATT CGACGTGGGC AGCCGGGTCG GGTTGGCCCG CCCGGGGTAC
CCGGCCTACC GGAACATCCT GCACGCCCTG GGCTGTGCGG TCGTTGACCT GCCGTGCGGA
CCGGAAACCC GGTACCAGCC GACGGTGTCG ATGGTCCGGG AGCACGACCT GGACGGGTTG
ATCGTGGCCA GCCCGGCCAA CCCGACCGGC ACCATGCTCG AACCGGGGGA GCTGGCGGCA
CTGGCCACCT GGAGCGCCGG AAACGCGGTC CGGCTGGTCT CGGACGAGAT CTACCACGGC
ATCACCTACA CCGGCAGCAC CAGCAGCAGT TGGCAGACCG ACCGGCACGG CATCGTGGTG
AACTCGTTCT CCAAGTACTT CTCGATGACC GGCTGGCGCA TCGGGTGGCT GCTGGTTCCC
GACGATCTGG TCGAGGTGGT CGATGCGCTG GCCGGCAACC TGGCCATCTG CCCGCCCGCC
CCGGCCCAGT ACGCGGCGAT GGCCGCCTTC GAGGCCTACG CCGAATGCGA CGGTCACGTG
CAGCGCTACG CGCAGCACCG CGATCTGCTG CTGGGCGGCC TGCGCCGGCT CGGCTTCGAC
CGGCTCGCCC CGGCCGACGG GGCGTTCTAC GTGTACGCCG ACATCGGCGA CCTGACCTCG
GATTCCACCG CGTTCTGCGC CCGGTTGCTG GCCGAGGCCG GTATCGCCGC CGCCCCGGGC
GTGGACTTCG ACGTCGTCGA CGGCCACCGG TTCCTGCGGT TCTCCTTCGC CGGGTCGATG
CGGACGATCG AGGGTGCGTT GGACGCGCTG GAACGGTTCC TGGCCGGCTG A
 
Protein sequence
MTGSGRRGDE IIGKAAGLGP SHRSRVAPFY VMQVLAAAAR RRAAGEVVWD LAAGQPSTPA 
PEPVRRAAHA TLDSHILGYT EAPGIRPLRE AIAGHYRDRY ALAVDADNVV VTTGSSGGFL
LAFLAAFDVG SRVGLARPGY PAYRNILHAL GCAVVDLPCG PETRYQPTVS MVREHDLDGL
IVASPANPTG TMLEPGELAA LATWSAGNAV RLVSDEIYHG ITYTGSTSSS WQTDRHGIVV
NSFSKYFSMT GWRIGWLLVP DDLVEVVDAL AGNLAICPPA PAQYAAMAAF EAYAECDGHV
QRYAQHRDLL LGGLRRLGFD RLAPADGAFY VYADIGDLTS DSTAFCARLL AEAGIAAAPG
VDFDVVDGHR FLRFSFAGSM RTIEGALDAL ERFLAG