Gene Emin_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1032 
Symbol 
ID6263165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1122888 
End bp1124060 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content42% 
IMG OID642611512 
Productcarbamoyl-phosphate synthase, small subunit 
Protein accessionYP_001875922 
Protein GI187251440 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAA AACTTAAAAA AGAATATAAA AAAGCCGTAT TAGAGTTATC TAACGGTATA 
AAGATGGAAG GACGTCTTAT CGGAGCGGAC GCAATGGTAA GCGGAGAGAT GGTTTTTTCA
ACCGGAATGC TTGCCTACAG CGAAGCGATG ACAGACCCTT CCTATCTCGG ACAAATTTTA
GTCTTTAGCT TTTCATTAAT TGGAAACTAC GGAATCCCCT CTTTTAAAGA AGGGGATTTT
TTTATGCCGC ACGGTTATGA AAGCGCCGGC ATTAAAACGC AGGGCATAAT AGTGTCCGAT
ACTTTTGACG ACTGTTTTCA CTACGAAAAA GGCAACAACA TTAAAACTTG GATGAAGGAT
AACGGCGTGC CCGGCATAGC CGGTATAGAC ACAAGATACC TTGTGCAAAT GATACGCGAC
TCTAAGGGGC CTTTATTCGG CCGCATAGTG CCGGAAGGCA AAAGCTCTTC TTACAACAGC
ACCAAGTTTG AGTTTTTAAA ACATTTTAAG AAAACGGATT ATGTGGACCC GTCCAAATAT
AATCTTTTGC CTTCGGCTTC GGTTAAAAAA CCTGTTACTC TTGGCAAAGG CGATATCAAA
ATAGCGCTGT TAGATTTTGG CGTAAAAAGA AATATTATAA GAATATTTAC GGACTACGGC
TGCACGGTAA CGGTTTACCC CTGGGATACA GATGTTGACA CTGTTGAAAC GGACGCATGG
GTGCTTAGCA ACGGCCCCGG CGACCCTAAA CAAACGGGCG ATTTAATACA AAGAGTAAAA
AAACTTATTA AAGGCGATAA ACCGATACTC GGCATCTGTT TAGGCCACCA GGTTTTGGCT
TTGGCGGCGG GCGCAAAAAC AAAAAAATTA AAACGCGGAC ACCGCAGTTT TAACCAGCCT
GTTTTTGACG TTAAAACCAG AAAAGCCTTT ATGAGCAGCC AAAACCATAG TTTTGAGGTG
GACAAAGCCT CCCTCCCCAA AGAATGGGAA GTGTGGTTTG AAAACGCTAA CGATTTTACC
ATTGAAGGCC TTAAACACAA AACAAAACCT TTTATGACGA CGCAGTTCCA CCCGGAAGCC
TCAGGCGGGC CCAATGACAC GGCCTGGGTT ATAAAAGATT TTGTTTCCCT TATAAAAAAA
TCGAATAAGA CTGTAAAAAA AGGAAAGAAA TAA
 
Protein sequence
MIKKLKKEYK KAVLELSNGI KMEGRLIGAD AMVSGEMVFS TGMLAYSEAM TDPSYLGQIL 
VFSFSLIGNY GIPSFKEGDF FMPHGYESAG IKTQGIIVSD TFDDCFHYEK GNNIKTWMKD
NGVPGIAGID TRYLVQMIRD SKGPLFGRIV PEGKSSSYNS TKFEFLKHFK KTDYVDPSKY
NLLPSASVKK PVTLGKGDIK IALLDFGVKR NIIRIFTDYG CTVTVYPWDT DVDTVETDAW
VLSNGPGDPK QTGDLIQRVK KLIKGDKPIL GICLGHQVLA LAAGAKTKKL KRGHRSFNQP
VFDVKTRKAF MSSQNHSFEV DKASLPKEWE VWFENANDFT IEGLKHKTKP FMTTQFHPEA
SGGPNDTAWV IKDFVSLIKK SNKTVKKGKK