Gene Namu_1578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1578 
Symbol 
ID8447176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1738761 
End bp1740374 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content72% 
IMG OID645040705 
ProductCBS domain containing protein 
Protein accessionYP_003200962 
Protein GI258651806 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.110465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.116098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGTTGC TGGTGATCGC GGCTGCTCTG GTGCCGATCG CCGGCCTGCT GGCCGCGGCC 
GACGCCGCGA TCACCATGGT CTCGCCGGCC CGGTTGGAGG AGCTGGCCCG CGAGGACCGG
CGCGGCGCGT CGTCGCTGCT GACCATCACC GAGGACCGGC CCCGGTACAC CAACCTGCTG
TTGCTGCTGA GGATCACCGC CGAGATCACC GCGACCGTGA TGGTGGCCAA GGTCGCGTTG
ACCACCCTGG GCTTCCAGTT CTGGGTCGGC CTGCTGCTGG TGGCGGTGAT GGTCGTGGTC
ACCTACGTGG GGATCGGCGT GCTGCCGCGC ACCATCGGCC GCCAGCACCC CTACCCGGTG
GGTCTGGCCC TGGCCGGCCC GACCCGGGCG CTGGCCCGGC TGCTCGCGCC GGTCGCCTCG
CTGCTGATCC TGATCGGCAA TGCGATCACC CCCGGCCGCG GCTTCCGGGA GGGCCCGTTC
TCCTCCGACA TCGAGCTGCG GGAGCTGGTC GACATCGCCG GCAGCCGCGG CGTGGTCGAG
GAGACCGAGC GGGAGATGCT GCAGAGTGTC TTCGATCTGG GCGACACCAT CGTGCGCGAG
GTGATGGTGC CGCGCACCGA GACGGTGTGG ATCGAACGGG ACAAGACGCT GCGCCAGGCG
CTGGCCCTGG CCAGCCGCTC GGGCATGTCC CGGATCCCCG TGGTCGGCGA GGACCTGGAC
GACATCGTCG GCGTCGCCTA CCTCAAGGAC CTGATCGCGC CGGCGATGAA CCTGGCCCCG
GACGACCAGG GTCCCGTGCT GACCCAGATC ATGCGGGAGC CGGTGTTCGT GCCCGAGTCC
AAGAACGTCG ACGACCTGCT GCGCGAGATG CAGCGCGATC GGACCCACTT CGCCGTCGTG
GTGGACGAGT ACGGCGGCAC CGCCGGCATC GTCACCATCG AGGACATCCT GGAGGAGATC
GTCGGCGAGA TCACCGACGA GTACGACGCG GACACCCCGG CCCCGGTGGT GCCGCTGCCC
GACGGCTCGT TCCGGGTCTC GGCCCGGTTG CCGGTCGAGG ACCTGGGAGA GCTGTTCGAC
GTCGAGCTCG ACGACGACGA GGTGGACACC GTCGGTGGGC TGCTGGCCCA GCAGCTCGGC
CGGGTCCCGC TGCCCGGCTC GGAGGTCGGC GTCGCCGGGC TGTTCCTGCA CGGGGAACCC
GGGGTGGACC GCCGGGGCCG GCCGCGGGTG CAGACCGTGC TGGTGCGCCG GCTGACCGCG
GCCGAGCTGG CCCAGGAGCA GGCCGAACGC GACCGCGAGG TGCGGGAGCG GGAGGACCAG
CGGCGGGCGC AGGAGCAGGA ACCGGAACGG CCCGCGGTCG AGGAGTCCGA CGCCGAACCG
GCCGAGGCCA AGCGGGCCAA ACCGCGCAAG AAAAAGAAGA AGGCCAAACC GCCGGCCGCC
GACGAGTCCG CCGAGCCCGG GGCGACGGAT TCCCCGGGTG ACTCCCCGCC GGACTCGCCG
GCCGGCGCCC CGCCGGGCCG CTCGACGCCG GCGGACCCGC CGCTGCTCGG ACCGCTGATC
GAGACCAGGA ACCAGCCAGT GCACGATTCG ACGGAACGGA ACCGAGCCGG ATGA
 
Protein sequence
MLLLVIAAAL VPIAGLLAAA DAAITMVSPA RLEELAREDR RGASSLLTIT EDRPRYTNLL 
LLLRITAEIT ATVMVAKVAL TTLGFQFWVG LLLVAVMVVV TYVGIGVLPR TIGRQHPYPV
GLALAGPTRA LARLLAPVAS LLILIGNAIT PGRGFREGPF SSDIELRELV DIAGSRGVVE
ETEREMLQSV FDLGDTIVRE VMVPRTETVW IERDKTLRQA LALASRSGMS RIPVVGEDLD
DIVGVAYLKD LIAPAMNLAP DDQGPVLTQI MREPVFVPES KNVDDLLREM QRDRTHFAVV
VDEYGGTAGI VTIEDILEEI VGEITDEYDA DTPAPVVPLP DGSFRVSARL PVEDLGELFD
VELDDDEVDT VGGLLAQQLG RVPLPGSEVG VAGLFLHGEP GVDRRGRPRV QTVLVRRLTA
AELAQEQAER DREVREREDQ RRAQEQEPER PAVEESDAEP AEAKRAKPRK KKKKAKPPAA
DESAEPGATD SPGDSPPDSP AGAPPGRSTP ADPPLLGPLI ETRNQPVHDS TERNRAG