Gene Namu_2694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2694 
Symbol 
ID8448306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2949853 
End bp2950935 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content63% 
IMG OID645041786 
Productprotein of unknown function DUF955 
Protein accessionYP_003202029 
Protein GI258652873 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2856] Predicted Zn peptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00279069 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0238985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCAA GACAGTCGAT GGTGACCCTC ATGCGCGAAG CTCGTGGTTG GACGCAGACG 
CAGCTCGCCG CGAACGCGGT CATGAGCCAG GCCGTCATCT CCAAGGTGGA GACCGGAGCC
TTGGAACTCG ATCCAGAGCG CCTGGCTCGC CTCGCCCACG CCCTCGACTG CCCTCCCGAT
TTTCTTGAGC GAAGCGCCGA CCTGCCGTCG ATCGAGATCA CCTGCCTGCA CCGCCGGCGG
GCCAGCACGA TGACTGTGAA CACGATGAAG CGCATTGAGG CCGTTACCCA CCTCTCACGA
ATCAGCGTAG AGGGGCTCCT CTCCGGCATC GAACTCGCAC CCGCAAGGAC GTTCGAGCGT
ATCGAGATAA TGGATGACCG CGGACCGGCC GCGATCGCGG GCGAACTTCG AAGACGCTGG
AGCACGCCAA ACGGCCCGAT CAGAAACCTG ATCGGCCTCG TGGAATCTGC GGGAGTCGTC
ATTGTCTTCC GATCGTTCGG CACAACCGGG CAGGACGCAG TGAGCACATG GCCGGAGGAT
CCGGGCCGCC CACCTATGAT GCTCATTAAT ACTGATCTGC CCTCCGACCG CCTCCGCTTC
ACGGTGGCGC ACGAGTTCGG CCATCTGGTC ATGCACAGAC TGCCAACTGA CAATCAGGAG
GCTGAAGCGA ATTCGTTTGC CGGCGAGTTC CTGGCTCCGG CGGACGAGAT CCGTCACGAA
CTGGAGGGCC TCAGGACCAG CGACTTCCGT CGGCTGATGG CCCTCAAGAT CGAATGGGGA
ATGTCAATGG CCGCGCTCAT CCGGCGCGCG CACGATCTGG AGACGATCAC TGACCGGCAG
TACCGCGAGT TTCAGGTTCG GCTTGGGAAG CTCGGGTGGC GCACCAGCGA GCCAGGCGAC
GTTGCACGTG AGAGCCCTTC AATCGTCAAC AAGATCATTG CGCTGCAGCG GCGTGAGCAC
GAATACTCGG ATGACGAACT CGCCCGTCTC GCAGGGATGA CTGAACCCGC GTTCCAGCGG
TACTTCCTCG CAGACCCGGG TAGTTCCGGC CAGCCCCCGC TCAGATTGGA CCTCCATGAG
TAA
 
Protein sequence
MAARQSMVTL MREARGWTQT QLAANAVMSQ AVISKVETGA LELDPERLAR LAHALDCPPD 
FLERSADLPS IEITCLHRRR ASTMTVNTMK RIEAVTHLSR ISVEGLLSGI ELAPARTFER
IEIMDDRGPA AIAGELRRRW STPNGPIRNL IGLVESAGVV IVFRSFGTTG QDAVSTWPED
PGRPPMMLIN TDLPSDRLRF TVAHEFGHLV MHRLPTDNQE AEANSFAGEF LAPADEIRHE
LEGLRTSDFR RLMALKIEWG MSMAALIRRA HDLETITDRQ YREFQVRLGK LGWRTSEPGD
VARESPSIVN KIIALQRREH EYSDDELARL AGMTEPAFQR YFLADPGSSG QPPLRLDLHE