Gene Namu_4703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4703 
Symbol 
ID8450333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5231604 
End bp5232995 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content67% 
IMG OID645043743 
Productcitrate synthase I 
Protein accessionYP_003203968 
Protein GI258654812 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGG CCACCGACGC CGCCCCGACC CCGGCGAACC CGCCCGGATC CGCCGAGGCG 
ACGGGCACCA TCCAGTTCAT GCCGCCGACC GAGGCCACGC TCGAGGCGCC GGCCACCGGC
TCCCTGGAGT ACGGCGGGCA GAAGCTGGAC CTGAAGGTGA TCCCGGCCAC CGAGGGCGCC
TCCGGCATGG AGATCTCCAA GCTGCTGACC ACCCTGGGTG TCATCACCCT GGATCCGGGG
TTCACCAACA CCGGATCGAC CACGTCCAAG ATCACCTACA TCGACGGTGA CGTCGGCATC
CTGCGCTACC GCGGGTACCC GATCGAGCAG CTGGCCGAGC ACTCCACGTT CCTGGAGACC
AGCTACCTGC TCATCCACGG TGAGCTGCCC ACGACCGCGG AGCTGGACTC GTTCACCAAG
CGGATCAGCC GGCACACCAT GCTGCACGAG GATCTCAAGC GGTTCTTCGA CGGCTTCCCC
CGGGACGCGC ACCCGATGCC GGTGCTTTCC AGCGCGGTCA GCGCGCTGTC GACGTTCTAC
CAGGACTCGC TGGACCCGTT CAATCCCGAG CAGGTCGAGC TGTCCACCGT GCGGCTGCTG
GCCAAGCTGC CCACCATCGC CGCGTACGCC TACCGCAAGT CGGTCGGCCA CCCGTTCCTG
TACCCGGACA ACTCGTTGAG CCTGGTCGAG AACTTCCTGC GGATGTCGTT CGGCTTCCCG
GCCGAGCCGT ACGAGGTCGA TCCCAAGCTG ACCAAGGCGC TCGACCAGCT GCTGATCCTG
CACGCAGACC ACGAGCAGAA CTGCTCCACC TCGACCGTGC GGCTGGTCGG CTCGTCCAAC
GCCAACCTGT TCGCCTCCGT CTCGGCCGGC ATCAACGCCC TGTTCGGCCC GCTGCACGGC
GGCGCCAACC AGGCCGTGCT GGAGATGCTG GAGGGCATCA AGAAGGACGG CGGTGACGTC
GGCCACTTCG TCAAGCGGGT CAAGGATCGC GAGCCCGGCG TCAAGCTGAT GGGCTTCGGG
CACCGGGTCT ACAAGAACTA CGACCCGCGT GCGGCCCTGG TCAAGGCCAC CGCCGACGAG
GTGCTGGCCT CCCTGGGCGC CCAGGACCAG CTGCTCGACC TGGCCAAGCA GCTGGAAGAG
GTGGCGTTGT CCGACGACTA CTTCATCTCC CGCAAGCTGT ACCCGAACGT GGACTTCTAC
ACCGGCCTGA TCTACAAGGC GATGGGCTTC CCGACCCGGA TGTTCACCGT GCTGTTCGCG
CTGGGCCGGC TGCCCGGCTG GATCGCCCAG TGGCGCGAGA TGATCAACGA CCCGGCCACC
AAGATCGGCC GCCCGCGGCA GGTCTACACC GGGTACACCG AGCGGGACTA CATCCCCACC
GAGCAGCGCT GA
 
Protein sequence
MTKATDAAPT PANPPGSAEA TGTIQFMPPT EATLEAPATG SLEYGGQKLD LKVIPATEGA 
SGMEISKLLT TLGVITLDPG FTNTGSTTSK ITYIDGDVGI LRYRGYPIEQ LAEHSTFLET
SYLLIHGELP TTAELDSFTK RISRHTMLHE DLKRFFDGFP RDAHPMPVLS SAVSALSTFY
QDSLDPFNPE QVELSTVRLL AKLPTIAAYA YRKSVGHPFL YPDNSLSLVE NFLRMSFGFP
AEPYEVDPKL TKALDQLLIL HADHEQNCST STVRLVGSSN ANLFASVSAG INALFGPLHG
GANQAVLEML EGIKKDGGDV GHFVKRVKDR EPGVKLMGFG HRVYKNYDPR AALVKATADE
VLASLGAQDQ LLDLAKQLEE VALSDDYFIS RKLYPNVDFY TGLIYKAMGF PTRMFTVLFA
LGRLPGWIAQ WREMINDPAT KIGRPRQVYT GYTERDYIPT EQR