Gene Namu_5389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5389 
Symbol 
ID8451022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp6028270 
End bp6029355 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content73% 
IMG OID645044419 
Product3-carboxymuconate cyclase-like protein 
Protein accessionYP_003204641 
Protein GI258655485 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2706] 3-carboxymuconate cyclase 
TIGRFAM ID[TIGR02588] conserved hypothetical protein TIGR02588 


Plasmid Coverage information

Num covering plasmid clones101 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGAGT TCCTGATCGG CTGCTACACC TTCGGCGCCG GCGGCCGCGG CCGGCGGATC 
GAAGTGGTCG CGCACGATCC GCAGACCGGC CGGTGGGAGC TGCTCACGGA CCGGGACCTG
GCCGAACCGG TGGCCGGTGC GTCCGCGACC GAACAGCCCG AGTCGCCGTC GTACCTGGCC
TGGCATCCGG ACGGTTGGCA CGTGTACGCG GCCGGTGAGG TCTCCGACGG GCAGGTCTGG
GCGCTCGCCG TCGCCCCCGG TGGGCACGGG TTGTCGGTGC TTGGTTCGGC CTCCACCGGC
GGCGCGCACC CGTGCCACCT GGCCGTCGAC CCGTCCGGCC GGGTGCTGCT GACCGCCAAC
TACACCTCGG GCAGCATCGC GGTTCATCGG CTGACGCCGG ACGGCCGGAT CGGCAAGTTG
AGTCAGCTCG TCGAGCACGA GGGTTGCGGC CCGCACCCGG ACCGGCAGCA GGGCCCGCAC
GCGCACATGG TCCACGTGCT GGATCAGACC CTGGTGCTCG CCGTCGATCT CGGTGCGGAT
CTGATCGTCG CCTACCGGCT CGACCCGGCG GCCGCCACCC TGACCCCGCT GATGACGTCG
CCGCTGCCGG CCGGGTTCGG ACCCCGCCAT CTGGTCGCTC TGGACCACGA CCGGGTCGCC
GTCGCCGGGG AGCTGACCGG TGAGCTGGCG CTGCTGCGGC TGGACCGGCA GACCGGGCAG
CTCGAGCTGC TGGACCTGCA GAGCGGCTCC GACGTGGCCG AGGAACCGGC CGCCCCCAGC
GGCGTCGGGC GCACCGCCGA CGGGCAGTTC GTGCTGATGG CCAACCGGGG CCCGAACACC
GTCGCGGCCT TCCGGGTGGA CGGCCAGAGC ATTCAGCTGG TCGACGAGAT CGGCTGCGGC
GGCGACCATC CCCGGGACCT GACGGTGGTC GGCGACCTGG TGTACGTGGC CAACCAGGAG
AGCGACTCGC TGACCGTGCT GCGGATCGAT CCCGAGACCG GCGCCCTGAG CGCCACCACG
AGTACGTTCA ACACCCCCAG CCCCACCCAG CTGCTGGCCG TGCCCGGCCC GCACGACGGG
AGATGA
 
Protein sequence
MHEFLIGCYT FGAGGRGRRI EVVAHDPQTG RWELLTDRDL AEPVAGASAT EQPESPSYLA 
WHPDGWHVYA AGEVSDGQVW ALAVAPGGHG LSVLGSASTG GAHPCHLAVD PSGRVLLTAN
YTSGSIAVHR LTPDGRIGKL SQLVEHEGCG PHPDRQQGPH AHMVHVLDQT LVLAVDLGAD
LIVAYRLDPA AATLTPLMTS PLPAGFGPRH LVALDHDRVA VAGELTGELA LLRLDRQTGQ
LELLDLQSGS DVAEEPAAPS GVGRTADGQF VLMANRGPNT VAAFRVDGQS IQLVDEIGCG
GDHPRDLTVV GDLVYVANQE SDSLTVLRID PETGALSATT STFNTPSPTQ LLAVPGPHDG
R