Gene Athe_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1541 
Symbol 
ID7409049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1628418 
End bp1631651 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content37% 
IMG OID643715913 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_002573412 
Protein GI222529530 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAGA GAAAAGACAT AAAAAAGGTT TTGATAATAG GCTCTGGTCC GATAGTAATT 
GGGCAGGCTG CTGAGTTTGA CTATTCAGGA ACTCAGGCCT GCCGCGCTTT AAAAGAAGAA
GGAATAGAAG TTGTGCTTGT AAACTCCAAC CCCGCAACAA TCATGACTGA TACAGAAATT
GCTGACAGGG TATATATTGA ACCAATTTCA GTTGACTATA TCGAAGAAAT AATCAAAAAA
GAAAGACCAC AAGGACTTTT GGCTGGACTT GGTGGTCAGA CAGCACTCAA TATGGCATTT
GAGCTTGCCG AAGCAGGAAT TTTGGAAAAG TACGGAGTTT GCCTTCTTGG AACATCGCTT
GAAACAATTA AAAAGGCAGA AGACAGGGAA CTTTTTAAAA AGACCATGAT TGAAATTGGA
GAACCTGTGC CAAAAAGCAT CATAGCACAC TCTGTACAAG AAGCTATAGA ATTCGCAAGA
GAAGTTGGAT ACCCTGTTAT TGTTCGTCCT GCCTATACCC TTGGCGGCAC AGGCGGTGGA
ATTGCTTACA ATGAAGAAGA ATTAAGATAT ATTGCAAGCA AAGGATTGAA ACTTTCTTTA
ATTCATCAAG TATTGATTGA ACAAAGTGTC CTTGGCTGGA AAGAAATAGA ATATGAGGTC
ATGAGGGACA GCAACGACAA TTGCATCACT GTGTGCAACA TGGAAAACAT AGACCCTGTG
GGAATTCACA CAGGTGACAG TATTGTTGTT GCCCCATCTC AAACTCTTTC TGATAAAGAG
TATCAGATGC TGCGAAGTGC TTCTTTAAAC ATAATAAGAA GTCTTAAAAT TGAGGGCGGA
TGCAACGTTC AGTTTGCACT AAATCCAAAC AGTATGGAAT ATGTGGTAAT TGAGGTAAAT
CCAAGAGTGA GCCGTTCGTC TGCCTTAGCA TCAAAAGCAA CAGGATATCC TATTGCCCGA
ATTGCTGCAA AAATAGCAAT TGGGCTTACA CTTGATGAAA TAATAAATCC TATCACCCAA
AACACATATG CAAGCTTTGA ACCGTCTATA GACTATGTTG TTGTAAAAGT GCCCAGATGG
CCGTTTGACA AGTTCGAAAA GGCAGACAGA CGACTTGGCA CACAAATGAA GTCAACTGGC
GAGGTCATGG CAATTGGAAG AACATTTGAA GAAGCATTTT TAAAAGCAAT AGATTCACTG
GATGTCAAGA TTAATTATCA GCTCGGTCTT AAGAAATTTG AAGAAATGCC AGATGATCAG
CTTTTAGAGT ATATAAAAAC TCCAAACGAT GAGAGAGTTT TCGCAATATG CGAAGCTCTT
TCTCGAAATT ATGACTGCAA GTTCATCTCA GACCTCAGCA AGATTGATTA CTTCTTTATT
GAAAAGTTCA AAAACATAGT TGATATGTCA AAACAGCTCA AAAAATATGA CATTGAATCA
CTGCCATATG ACCTTTTACA AAAAGCTAAA AGGCTTGGGT TTGGTGACTC ATACATTGCA
AATCTTTTAA AAGAAGATGT GGACGAGGTA ATAGAAATAA GAGAGAAATG TAAGCTAAAA
CCTTCTTTCA AGATGGTTGA CACCTGTGCA GGTGAGTTTG AAGCAAAAAC ACCATATTTT
TATTCCACAT ACGAAAAAGA AACTGACTTA GTTGTAAGTT CAAAGCCCAA GGCAATTGTA
ATAGGTTCAG GACCAATTCG AATTGGTCAG GGAATTGAAT TTGATTATTG CTGTGTTCAT
TCAATATTCG CGTTAAAAGA AGAAGGAGTT GAAGCTATAA TCATCAATAA CAATCCTGAA
ACAGTATCAA CTGACTTCGA TACATCAGAC AAGCTCTTTT TTGAACCTCT TACAAAAGAA
TGTGTTTTAG ACATAATAAA ACAGGAAAAA CCAATGGGCG TAATAGTTCA ATTTGGTGGT
CAGACTGCAA TAAATATGGC TTCATACCTT GCAAAAAACG GTGTGAAGAT TTTAGGAACT
TCTATGGAAA GTATTGACAC TGCAGAGGAT AGAGACAAGT TTTTGAATCT TCTTAAAAAC
CTCAATATTC CTTATCCTCC AGGTGGGGCT GCATACAGCT TGGAAGATGC GGTAAAGGTT
GCTCAGCAAA TTGGCTATCC TGTTCTGGTA AGACCATCTT ATGTTCTTGG CGGAAGGGCA
ATGGAGATTG TCTACAGCCG CGAGGAGCTT GAAAAATATA TCAAAGCTGC AATTGAGATA
TCAATAAAGC ATCCTATTTT GATCGACAAA TACATCCTTG GAAAAGAAGC GGAAGTTGAT
GGAATCTCGG ATGGAGAAGA TGTGTTAATA CCTGGAATCA TGGAACATAT TGAAAGAGCT
GGTGTTCATT CTGGCGACAG CATGGCAGTA TTCCCACCCC ATACACTCTC TGAGAGGGTT
AAAGAAAAAA TTGTTGATTA TACTATAAAA CTTGCCCGCG CCTTGAGAGT TGTAGGACTT
TTCAATATTC AATTTGTAAT TGATAAAGAT GAAAATGTAT ATGTAATAGA AGTAAATCCT
CGCGCAAGCA GAACTGTGCC AATTTTGAGC AAGGTTACGG GAATTCCTAT GATAAAGATT
GCAACTAAAC TCATACTTGG CAAGAAGCTC AAAGATTTGG GATACCAAAC TGGCCTTGTA
AAAGAACCAG ACTTTTTTGC AGTAAAAGCT CCTGTATTTT CGTTTTCCAA ATTATCAAAG
GTTGATGCAT ATTTAGGTCC AGAGATGAAA TCCACTGGCG AAGTTCTTGG TATCTCCAAA
AACCTAAAAG TTGCTCTTTA TAAAGCTTTT ATATCCTCCA ATCATAAATT TACAAAAAAC
GGAAGCTGCT TGATTTTAGC ACCTGAAAGC GAAAGGGATG CTATACAGCA GATCATAAGA
AAACTATATG AAGTAAACTT CAAAGTGTTC CTCTTAGATA GCATGAAGGA TTATATAAAA
GGCTTAAATG TAGAGTTTAT AAACAAAGAG ACTGCTCAAA AGTTGCTACT TGAAGACAAA
TTCTCGTTTG TAATAAATAT TCCATCCAAA GACAAAATGC AAGAGTTTGG TTTTGTTTTG
CGAAGGCTTT CTGTTGAGTT TGGAATCACA ACATTGACAT CGATTGACAC AGCACTGTAT
TATGTGGACG TTTTATCTTC ACTGGATGAG ATTGAAAAAG ATATCTATTG CTTGAATGAT
TTATTTAAAG ATGAAAGGAT GAAGTGTTAT GAGACATTTT CTTCATCTAA ATGA
 
Protein sequence
MPKRKDIKKV LIIGSGPIVI GQAAEFDYSG TQACRALKEE GIEVVLVNSN PATIMTDTEI 
ADRVYIEPIS VDYIEEIIKK ERPQGLLAGL GGQTALNMAF ELAEAGILEK YGVCLLGTSL
ETIKKAEDRE LFKKTMIEIG EPVPKSIIAH SVQEAIEFAR EVGYPVIVRP AYTLGGTGGG
IAYNEEELRY IASKGLKLSL IHQVLIEQSV LGWKEIEYEV MRDSNDNCIT VCNMENIDPV
GIHTGDSIVV APSQTLSDKE YQMLRSASLN IIRSLKIEGG CNVQFALNPN SMEYVVIEVN
PRVSRSSALA SKATGYPIAR IAAKIAIGLT LDEIINPITQ NTYASFEPSI DYVVVKVPRW
PFDKFEKADR RLGTQMKSTG EVMAIGRTFE EAFLKAIDSL DVKINYQLGL KKFEEMPDDQ
LLEYIKTPND ERVFAICEAL SRNYDCKFIS DLSKIDYFFI EKFKNIVDMS KQLKKYDIES
LPYDLLQKAK RLGFGDSYIA NLLKEDVDEV IEIREKCKLK PSFKMVDTCA GEFEAKTPYF
YSTYEKETDL VVSSKPKAIV IGSGPIRIGQ GIEFDYCCVH SIFALKEEGV EAIIINNNPE
TVSTDFDTSD KLFFEPLTKE CVLDIIKQEK PMGVIVQFGG QTAINMASYL AKNGVKILGT
SMESIDTAED RDKFLNLLKN LNIPYPPGGA AYSLEDAVKV AQQIGYPVLV RPSYVLGGRA
MEIVYSREEL EKYIKAAIEI SIKHPILIDK YILGKEAEVD GISDGEDVLI PGIMEHIERA
GVHSGDSMAV FPPHTLSERV KEKIVDYTIK LARALRVVGL FNIQFVIDKD ENVYVIEVNP
RASRTVPILS KVTGIPMIKI ATKLILGKKL KDLGYQTGLV KEPDFFAVKA PVFSFSKLSK
VDAYLGPEMK STGEVLGISK NLKVALYKAF ISSNHKFTKN GSCLILAPES ERDAIQQIIR
KLYEVNFKVF LLDSMKDYIK GLNVEFINKE TAQKLLLEDK FSFVINIPSK DKMQEFGFVL
RRLSVEFGIT TLTSIDTALY YVDVLSSLDE IEKDIYCLND LFKDERMKCY ETFSSSK