Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1541 |
Symbol | |
ID | 7409049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1628418 |
End bp | 1631651 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643715913 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_002573412 |
Protein GI | 222529530 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAGA GAAAAGACAT AAAAAAGGTT TTGATAATAG GCTCTGGTCC GATAGTAATT GGGCAGGCTG CTGAGTTTGA CTATTCAGGA ACTCAGGCCT GCCGCGCTTT AAAAGAAGAA GGAATAGAAG TTGTGCTTGT AAACTCCAAC CCCGCAACAA TCATGACTGA TACAGAAATT GCTGACAGGG TATATATTGA ACCAATTTCA GTTGACTATA TCGAAGAAAT AATCAAAAAA GAAAGACCAC AAGGACTTTT GGCTGGACTT GGTGGTCAGA CAGCACTCAA TATGGCATTT GAGCTTGCCG AAGCAGGAAT TTTGGAAAAG TACGGAGTTT GCCTTCTTGG AACATCGCTT GAAACAATTA AAAAGGCAGA AGACAGGGAA CTTTTTAAAA AGACCATGAT TGAAATTGGA GAACCTGTGC CAAAAAGCAT CATAGCACAC TCTGTACAAG AAGCTATAGA ATTCGCAAGA GAAGTTGGAT ACCCTGTTAT TGTTCGTCCT GCCTATACCC TTGGCGGCAC AGGCGGTGGA ATTGCTTACA ATGAAGAAGA ATTAAGATAT ATTGCAAGCA AAGGATTGAA ACTTTCTTTA ATTCATCAAG TATTGATTGA ACAAAGTGTC CTTGGCTGGA AAGAAATAGA ATATGAGGTC ATGAGGGACA GCAACGACAA TTGCATCACT GTGTGCAACA TGGAAAACAT AGACCCTGTG GGAATTCACA CAGGTGACAG TATTGTTGTT GCCCCATCTC AAACTCTTTC TGATAAAGAG TATCAGATGC TGCGAAGTGC TTCTTTAAAC ATAATAAGAA GTCTTAAAAT TGAGGGCGGA TGCAACGTTC AGTTTGCACT AAATCCAAAC AGTATGGAAT ATGTGGTAAT TGAGGTAAAT CCAAGAGTGA GCCGTTCGTC TGCCTTAGCA TCAAAAGCAA CAGGATATCC TATTGCCCGA ATTGCTGCAA AAATAGCAAT TGGGCTTACA CTTGATGAAA TAATAAATCC TATCACCCAA AACACATATG CAAGCTTTGA ACCGTCTATA GACTATGTTG TTGTAAAAGT GCCCAGATGG CCGTTTGACA AGTTCGAAAA GGCAGACAGA CGACTTGGCA CACAAATGAA GTCAACTGGC GAGGTCATGG CAATTGGAAG AACATTTGAA GAAGCATTTT TAAAAGCAAT AGATTCACTG GATGTCAAGA TTAATTATCA GCTCGGTCTT AAGAAATTTG AAGAAATGCC AGATGATCAG CTTTTAGAGT ATATAAAAAC TCCAAACGAT GAGAGAGTTT TCGCAATATG CGAAGCTCTT TCTCGAAATT ATGACTGCAA GTTCATCTCA GACCTCAGCA AGATTGATTA CTTCTTTATT GAAAAGTTCA AAAACATAGT TGATATGTCA AAACAGCTCA AAAAATATGA CATTGAATCA CTGCCATATG ACCTTTTACA AAAAGCTAAA AGGCTTGGGT TTGGTGACTC ATACATTGCA AATCTTTTAA AAGAAGATGT GGACGAGGTA ATAGAAATAA GAGAGAAATG TAAGCTAAAA CCTTCTTTCA AGATGGTTGA CACCTGTGCA GGTGAGTTTG AAGCAAAAAC ACCATATTTT TATTCCACAT ACGAAAAAGA AACTGACTTA GTTGTAAGTT CAAAGCCCAA GGCAATTGTA ATAGGTTCAG GACCAATTCG AATTGGTCAG GGAATTGAAT TTGATTATTG CTGTGTTCAT TCAATATTCG CGTTAAAAGA AGAAGGAGTT GAAGCTATAA TCATCAATAA CAATCCTGAA ACAGTATCAA CTGACTTCGA TACATCAGAC AAGCTCTTTT TTGAACCTCT TACAAAAGAA TGTGTTTTAG ACATAATAAA ACAGGAAAAA CCAATGGGCG TAATAGTTCA ATTTGGTGGT CAGACTGCAA TAAATATGGC TTCATACCTT GCAAAAAACG GTGTGAAGAT TTTAGGAACT TCTATGGAAA GTATTGACAC TGCAGAGGAT AGAGACAAGT TTTTGAATCT TCTTAAAAAC CTCAATATTC CTTATCCTCC AGGTGGGGCT GCATACAGCT TGGAAGATGC GGTAAAGGTT GCTCAGCAAA TTGGCTATCC TGTTCTGGTA AGACCATCTT ATGTTCTTGG CGGAAGGGCA ATGGAGATTG TCTACAGCCG CGAGGAGCTT GAAAAATATA TCAAAGCTGC AATTGAGATA TCAATAAAGC ATCCTATTTT GATCGACAAA TACATCCTTG GAAAAGAAGC GGAAGTTGAT GGAATCTCGG ATGGAGAAGA TGTGTTAATA CCTGGAATCA TGGAACATAT TGAAAGAGCT GGTGTTCATT CTGGCGACAG CATGGCAGTA TTCCCACCCC ATACACTCTC TGAGAGGGTT AAAGAAAAAA TTGTTGATTA TACTATAAAA CTTGCCCGCG CCTTGAGAGT TGTAGGACTT TTCAATATTC AATTTGTAAT TGATAAAGAT GAAAATGTAT ATGTAATAGA AGTAAATCCT CGCGCAAGCA GAACTGTGCC AATTTTGAGC AAGGTTACGG GAATTCCTAT GATAAAGATT GCAACTAAAC TCATACTTGG CAAGAAGCTC AAAGATTTGG GATACCAAAC TGGCCTTGTA AAAGAACCAG ACTTTTTTGC AGTAAAAGCT CCTGTATTTT CGTTTTCCAA ATTATCAAAG GTTGATGCAT ATTTAGGTCC AGAGATGAAA TCCACTGGCG AAGTTCTTGG TATCTCCAAA AACCTAAAAG TTGCTCTTTA TAAAGCTTTT ATATCCTCCA ATCATAAATT TACAAAAAAC GGAAGCTGCT TGATTTTAGC ACCTGAAAGC GAAAGGGATG CTATACAGCA GATCATAAGA AAACTATATG AAGTAAACTT CAAAGTGTTC CTCTTAGATA GCATGAAGGA TTATATAAAA GGCTTAAATG TAGAGTTTAT AAACAAAGAG ACTGCTCAAA AGTTGCTACT TGAAGACAAA TTCTCGTTTG TAATAAATAT TCCATCCAAA GACAAAATGC AAGAGTTTGG TTTTGTTTTG CGAAGGCTTT CTGTTGAGTT TGGAATCACA ACATTGACAT CGATTGACAC AGCACTGTAT TATGTGGACG TTTTATCTTC ACTGGATGAG ATTGAAAAAG ATATCTATTG CTTGAATGAT TTATTTAAAG ATGAAAGGAT GAAGTGTTAT GAGACATTTT CTTCATCTAA ATGA
|
Protein sequence | MPKRKDIKKV LIIGSGPIVI GQAAEFDYSG TQACRALKEE GIEVVLVNSN PATIMTDTEI ADRVYIEPIS VDYIEEIIKK ERPQGLLAGL GGQTALNMAF ELAEAGILEK YGVCLLGTSL ETIKKAEDRE LFKKTMIEIG EPVPKSIIAH SVQEAIEFAR EVGYPVIVRP AYTLGGTGGG IAYNEEELRY IASKGLKLSL IHQVLIEQSV LGWKEIEYEV MRDSNDNCIT VCNMENIDPV GIHTGDSIVV APSQTLSDKE YQMLRSASLN IIRSLKIEGG CNVQFALNPN SMEYVVIEVN PRVSRSSALA SKATGYPIAR IAAKIAIGLT LDEIINPITQ NTYASFEPSI DYVVVKVPRW PFDKFEKADR RLGTQMKSTG EVMAIGRTFE EAFLKAIDSL DVKINYQLGL KKFEEMPDDQ LLEYIKTPND ERVFAICEAL SRNYDCKFIS DLSKIDYFFI EKFKNIVDMS KQLKKYDIES LPYDLLQKAK RLGFGDSYIA NLLKEDVDEV IEIREKCKLK PSFKMVDTCA GEFEAKTPYF YSTYEKETDL VVSSKPKAIV IGSGPIRIGQ GIEFDYCCVH SIFALKEEGV EAIIINNNPE TVSTDFDTSD KLFFEPLTKE CVLDIIKQEK PMGVIVQFGG QTAINMASYL AKNGVKILGT SMESIDTAED RDKFLNLLKN LNIPYPPGGA AYSLEDAVKV AQQIGYPVLV RPSYVLGGRA MEIVYSREEL EKYIKAAIEI SIKHPILIDK YILGKEAEVD GISDGEDVLI PGIMEHIERA GVHSGDSMAV FPPHTLSERV KEKIVDYTIK LARALRVVGL FNIQFVIDKD ENVYVIEVNP RASRTVPILS KVTGIPMIKI ATKLILGKKL KDLGYQTGLV KEPDFFAVKA PVFSFSKLSK VDAYLGPEMK STGEVLGISK NLKVALYKAF ISSNHKFTKN GSCLILAPES ERDAIQQIIR KLYEVNFKVF LLDSMKDYIK GLNVEFINKE TAQKLLLEDK FSFVINIPSK DKMQEFGFVL RRLSVEFGIT TLTSIDTALY YVDVLSSLDE IEKDIYCLND LFKDERMKCY ETFSSSK
|
| |