Gene Namu_3806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3806 
Symbol 
ID8449425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4178317 
End bp4179387 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content71% 
IMG OID645042857 
Productpeptidase M14 carboxypeptidase A 
Protein accessionYP_003203093 
Protein GI258653937 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2866] Predicted carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0393058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.437302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTACC GGACCGTCGC CCAACTCGCC GCGGTGCTGA ACCAGGTCAC GGCCGGCGCC 
CCCGAGCTGT GCACGTTGCT GCCCCTGCCC GAGCGGTCGG TCCAGGGATC CGCGGTGTCC
GCACTGCGCA TCGCCGCCGC CGGCCCGGTG CCGGTGGACG AGCGTCCCGG CGTCCTGCTC
ATCGCCGGGA CTCACGCCCG CGAGCTGATG AACCCCGACC TGCTGGTCGA ACTGGCCGTC
GATCTGGTCG CCGCCCAGCG CACCGGGACC GACATCGTGC TCGGCGGCCG GACCTGGCCG
GCCGCCGCGG TCCGGGCGAT GCTGGCCGCC GCGACGGTGT ACCTGTTGCC GTGCGTCAAC
CCGGACGGCC GCACCTACGT GCTCACCGTC GACGACATGT GGCGCAAGAA CCGTCGCGAC
AACCCGGGCA CCACCTGCGA CGGGGTCGAC CTCAACCGCA ACGCGGACAT CCTCTGGGGG
GTTACCGAAG GCCAGACGTC CTGCTCGCCG TGCACCGACA TCTATTGCGG TTCCGGTGCT
TTCAGCGAGC CGGAGAACCG GAACGTCAAG CACCTGCTGG ACACCTACCG CATCGACGCC
TTCGCCGACG TGCATTCGTT CTCCGAGCTG GTCATCTACC CCTGGGGGCA CGCCCCCAGC
CAGACGACGG ACCCCACGCA GAACTTCCGC ACGCTGACCA CCACGACCTG CCGTCCGTTG
AACCGCCCCG GCTACGCCGA GTACATCGCC CCGGCCGATC TGGCCCGGTT CCAGGCGGTG
GCCGGACGGA TCGTCGCCGA GATCGCCGCG GTCCGCGGCC GCCAGTACAG CCCGGAGCCG
GGCATGACGC TCTACCCGAC CACCGGCACC CACAGCGACT ACGCCTGGAG CCGGCACCTG
GCCGACCCGA ACCTGCGCCG CACCGAGGGC TACACCATCG AGACCGGCCC CTCCGGTGAC
GACGCCCGCG AGTCGTTCCA CCCCCGCGAC CCCGAGCCGA TCAAGCGCGA GGTCGAGTCC
GGGCTGCTCG CCCTGATCCA GGCCACCGCC GCCACCGCCA CCCCCGCCTG A
 
Protein sequence
MMYRTVAQLA AVLNQVTAGA PELCTLLPLP ERSVQGSAVS ALRIAAAGPV PVDERPGVLL 
IAGTHARELM NPDLLVELAV DLVAAQRTGT DIVLGGRTWP AAAVRAMLAA ATVYLLPCVN
PDGRTYVLTV DDMWRKNRRD NPGTTCDGVD LNRNADILWG VTEGQTSCSP CTDIYCGSGA
FSEPENRNVK HLLDTYRIDA FADVHSFSEL VIYPWGHAPS QTTDPTQNFR TLTTTTCRPL
NRPGYAEYIA PADLARFQAV AGRIVAEIAA VRGRQYSPEP GMTLYPTTGT HSDYAWSRHL
ADPNLRRTEG YTIETGPSGD DARESFHPRD PEPIKREVES GLLALIQATA ATATPA