Gene Athe_1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1540 
Symbol 
ID7409048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1627341 
End bp1628411 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content36% 
IMG OID643715912 
Productcarbamoyl-phosphate synthase, small subunit 
Protein accessionYP_002573411 
Protein GI222529529 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG CGGTACTTGT TTTAGAAGAC GGGCTGACAT TTGAAGGAAT TTCACTTGGT 
AGTGAAGGTG AAACAATTGG TGAAATTGTA TTCAATACAT GCATGACAGG ATACCAAGAG
GTACTGACTG ACCCTTCCTA CAATGGTCAG ATTGTCACAA TGACCTACCC TCTTATAGGC
AATTACGGAA TAAATGGTGA AGATGTGGAG TCATACAAAC CTCATGTTGA AGGTTTTATT
GTAAGAGAGG CATGTAAAAC ACCTTCAAAT TTCAGGGCTC AGAAGACATT GCATGAATAT
TTAAAAGAGA ATAATATTGT GGCAATTGAG GGTGTTGATA CACGAGCTAT AACTGAACAC
ATACGAAATA AAGGCTCAAT GCTTGGTATT ATCTCAACAG AGACCGACAA TAAAGAGATA
CTTTTGAGCA AAATATTTGA GTATAAAAAA CAAAAGCCAT CTTTGGTAAA AGAAGTTTCG
ACCACTCAGG TATACAGAAT TGAAGGCAGT GGCAAGAAGG TTGCTGTTTT AGACTTTGGA
ATAAAGCAAA ACATCTTGAG AGAACTTTCA AAAAGAGGGC TTGATTTATA TGTGTTTCCG
TATAATAGCT CTCTTGACCA AATCATGAAT ATTAATCCAG AAGGGTTTGT GTTCTCAAAC
GGACCGGGTG ATCCTACTGA CTTAGAAGAA TTCTTCCCCA CTTTGAAACA AATAATAGAG
CTAAAAAAAC CTATCCTTGG AATCTGTCTT GGCCATCAGC TTTTAGGTTT GTGTTTAGGA
CTTAAGACAT ACAAGCTGAA ATTTGGTCAC CATGGTGGAA ACCACCCGGT TAAAGATTTA
ATTTCCGGCA AGGTTTATAT AACCTCACAG AATCATAACT ATGCAATTGA ATATAAAGAA
TATGAAAAAA TAAAAATCAC ACATGTAAAT GTAAATGATA AAACTGTTGA GGGGTTTGCT
CATCTTGATT TGCCTATAGT ATCTGTTCAA TACCATCCTG AGGCAAGCCC GGGTCCACAT
GACTCAAAAT ATATATTTGA CCAATTTACT AAGCTTTTGA ATGGAGTGTA G
 
Protein sequence
MKKAVLVLED GLTFEGISLG SEGETIGEIV FNTCMTGYQE VLTDPSYNGQ IVTMTYPLIG 
NYGINGEDVE SYKPHVEGFI VREACKTPSN FRAQKTLHEY LKENNIVAIE GVDTRAITEH
IRNKGSMLGI ISTETDNKEI LLSKIFEYKK QKPSLVKEVS TTQVYRIEGS GKKVAVLDFG
IKQNILRELS KRGLDLYVFP YNSSLDQIMN INPEGFVFSN GPGDPTDLEE FFPTLKQIIE
LKKPILGICL GHQLLGLCLG LKTYKLKFGH HGGNHPVKDL ISGKVYITSQ NHNYAIEYKE
YEKIKITHVN VNDKTVEGFA HLDLPIVSVQ YHPEASPGPH DSKYIFDQFT KLLNGV