Gene Athe_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1378 
SymbolcarB 
ID7409121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1462432 
End bp1465659 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content37% 
IMG OID643715743 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_002573251 
Protein GI222529369 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCACTGA GAAAGGATAT AAAAAAGGTT TTGGTTATTG GTTCAGGACC GATTATAATT 
GGCCAGGCAG CTGAGTTTGA CTATTCAGGA AGCCAGGCTT GCAAGGCATT AAAAGAAGAA
GGCATTGAAG TTGTCCTTAT AAACTCAAAC CCAGCAACAA TCATGACAGA CAAAACAATG
GCAGACAGTA TCTATATTGA GCCTATAACT TGTGAGATAA TAGAAAAGAT AATTCAAAAA
GAAAAAGTTG ACTCTATATT GCCAACACTT GGAGGTCAAA CGGGCCTTAA CACTGCTGTT
GAGCTTTACA AAAGTGGGAT CCTTGACAAA TACAATGTCA AGGTAATTGG AACAAACATT
GAAGCTATTG AGTTCGCAGA AGATAGACAA CTTTTTAAAC AGCTGATGAT AAAAATTGGC
GAACCTGTTG TTCCAAGTGA GGTTGTAAAC TGCGTAGAAG ATGGTCTTGC GTTTGCAAAG
AAAATAGGCT TTCCTGTTAT AATAAGGCCG GCATATACCC TTGGGGGGAC TGGTGGTGGT
ATTGCAAACA ATGAAGATGA GTTTGTAGAG ATTGCAAGAA GAGGTCTTTC TTACAGTCCT
GTACACCAAA TACTTGTAGA GAAAAGTATA AAAGGGTGGA AAGAGATAGA GTATGAGGTT
ATGCGAGACT CAAACGGATG CCTGATTACC GTGTGTAATA TGGAAAATAT TGACCCGGTC
GGAATTCATA CAGGCGATAG CATTGTTGTT GCACCATCGC AGACACTTTC TGATAAAGAA
TATCAAATGC TAAGGTCATC TGCTCTCAAA ATCATAGATA CTCTGAAGAT TGAGGGAGGA
TGTAATGTCC AGTTTGCCCT AAATCCTGAT AGTTTTGAAT ATGCAGTGAT TGAAGTAAAT
CCAAGAGTCA GTCGCTCATC TGCTTTAGCT TCAAAAGCAA CAGGGTATCC AATTGCCAGA
ATTGCTGCAA AGATTGCGCT TGGCTATACA CTTGATGAAA TAGAAAATGC CATTACAAAG
ATGACATACG CAAGCTTTGA ACCGACTCTT GACTATGTTG TTCTGAAAAT TCCACGCTGG
CCGTTTGACA AGTTTACTTA TGCAAATAGA AAACTTGGTA CACAGATGAA AGCAACTGGA
GAGATCATGG CAATCGGAAG AACATTTGAA GAAAGCCTTT TAAAAGGTAT AAGGTCTCTT
GACATAGGGC TGGACTACTT AGACCTCCCA GAGCTAAAAA GTTTGGATAA CAAAACTCTT
TCAAAGCTTA TAATCGAGGC TGATGACAGA AGAATATTTG CACTTGCTGA AGCTATAAGA
AGAGGATATG AAGTAGAGTA CCTATACCAA ATCAGTAAAG TAGACAGATT TTTTCTCCAC
AAGATTAAAA ATATAATTGA GATGGAAGAA AGAATAAAAA AAGAGGAATT AAACAGCAGT
ATCCTGCTTG AAGCAAAGAA GATGGGGTTC AGTGACAAGA CGATTGCAAA TCTCAAAGGG
ATATCTGAAA ATAATGTAAG AAGTTTGAGA AAAAGCCTAA ACATAACATC GGTATATAAA
ATGGTTGATA CATGCGCAGC AGAATTTGAA GCAAAAACTC CATATTATTA TTCAACATAT
GAAAAGGAAA ATGATGTAGC AGTACTACAA ATATCAGATA CACAGAGAAA GATTGTTGTT
TTGGGCTCAG GTCCGATTAG AATTGGTCAG GGAATTGAGT TTGATTATAC TTCTGTACAC
AGTGTTTATG CACTTTCAAA ACTTGGCATA AAATCTGTGA TTATCAATAA CAATCCAGAA
ACTGTTAGTA CTGATTTTGA TACCTCAGAT ATGCTCTTTT TTGAACCTCT CACAAAAGAA
GATGTCTTAA ATGTTATTGA AACAGTAAAA CCAGAGGGTA TTATAGTCCA ATTTGGTGGT
CAGACAGCTA TAAAACTTTC TCAGCAGCTT GCAAAAGAGG GTATCAAGAT TTTTGGTACA
AGCGCAGAGG GAATAGATAT TGCCGAAGAC AGAGAAAGAT TTGACAAAAT CTTGAATAAG
CTTAACATCA AAAGACCTCC AGGCTTTACA TGTTATACAC TGCAAGAAGC ACTAAGAATA
GCAAACTCAT TAGGATATCC TGTGCTTGTG CGGCCTTCAT ATGTTCTTGG CGGACAGGGT
ATGAAAATTG CCTTTGACGA TAACGATATT GTAGAGATGC TCAGCTATGC GAAAAATCTC
AATGATCATC CTATCCTCAT TGACAAATAC ATCGTCGGAA AAGAAATAGA GGTTGATGCT
ATATCTGATG GCGAGGACAT CTTGATCCCA GGAATAATGG AACATATTGA AAGAGCAGGT
GTTCATTCGG GAGACAGTAT TTCCCTGTAT CCTGCAAGGA ATATTTCAAA GTATATTGAA
GAGAAGATTG TCGAATATAC GCTCAAAATA GCAAAAGAAC TTGAATGCAA AGGACTTATA
AATGTGCAGT TTATTGTTCA AAACGAAGAA CTTTATGTTA TAGAAGTAAA TCCGAGAGGC
AGTAGAACAG TACCCTTTTT GAGCAAGGTA ACTGGTGTTC CTATGGTTGA GCTTGCAACA
ATGGTAAGTT TGGGATACAG GCTGAAAGAT TTGGTAAACA AAGTTGGGCT TTTGCCGAAA
AAAGATTTCT ATGCCTTTAA AGTTCCTGTT TTCTCATTTG AAAAACTGCC TGATGTTGAA
GTATCACTTG GTCCTGAGAT GAAGTCAACA GGTGAGGTTA TGGGAATTTC GAAAGACTAC
TATGTTGCAC TTTACAAAGG GCTTGTTGCA AGCGGTACAA AGCTACCATT AGAAGGTGGA
GTACTTTTTA CTGTTGCCGA CCCTGACAAA AATGAAATCA TCCCAATTGC TGAGAAGTTT
GAAAAGCTTG GATTCAAGAT ATACGCAACA TCAAAGACAG CAAAACATCT GAATTTTTAT
CAGGTTGCTG CAAATTATGT TAAAAAAGTG TCTGAAGGAA GTCCAAATAT AATAGATCTG
ATTAGAAAAG GAGAAATAAA CATTGTTATC AACACTCCTA CAAAAGGAAG ACAGCCGCAA
AGAGATGGTT TTTTGATAAG AAGGTTTGCT GTGGAAAATA AGGTTCCAAT CTTCACTTCA
GTTGACACTG CAAAGGCTGT AGTTGAAATT ATTGAGTTTC TAAAACAAAA AAGAGAACTT
GACATATTTA ACATAGGAGA GATTGATAAT GAAGCTATTA GACGTTGA
 
Protein sequence
MPLRKDIKKV LVIGSGPIII GQAAEFDYSG SQACKALKEE GIEVVLINSN PATIMTDKTM 
ADSIYIEPIT CEIIEKIIQK EKVDSILPTL GGQTGLNTAV ELYKSGILDK YNVKVIGTNI
EAIEFAEDRQ LFKQLMIKIG EPVVPSEVVN CVEDGLAFAK KIGFPVIIRP AYTLGGTGGG
IANNEDEFVE IARRGLSYSP VHQILVEKSI KGWKEIEYEV MRDSNGCLIT VCNMENIDPV
GIHTGDSIVV APSQTLSDKE YQMLRSSALK IIDTLKIEGG CNVQFALNPD SFEYAVIEVN
PRVSRSSALA SKATGYPIAR IAAKIALGYT LDEIENAITK MTYASFEPTL DYVVLKIPRW
PFDKFTYANR KLGTQMKATG EIMAIGRTFE ESLLKGIRSL DIGLDYLDLP ELKSLDNKTL
SKLIIEADDR RIFALAEAIR RGYEVEYLYQ ISKVDRFFLH KIKNIIEMEE RIKKEELNSS
ILLEAKKMGF SDKTIANLKG ISENNVRSLR KSLNITSVYK MVDTCAAEFE AKTPYYYSTY
EKENDVAVLQ ISDTQRKIVV LGSGPIRIGQ GIEFDYTSVH SVYALSKLGI KSVIINNNPE
TVSTDFDTSD MLFFEPLTKE DVLNVIETVK PEGIIVQFGG QTAIKLSQQL AKEGIKIFGT
SAEGIDIAED RERFDKILNK LNIKRPPGFT CYTLQEALRI ANSLGYPVLV RPSYVLGGQG
MKIAFDDNDI VEMLSYAKNL NDHPILIDKY IVGKEIEVDA ISDGEDILIP GIMEHIERAG
VHSGDSISLY PARNISKYIE EKIVEYTLKI AKELECKGLI NVQFIVQNEE LYVIEVNPRG
SRTVPFLSKV TGVPMVELAT MVSLGYRLKD LVNKVGLLPK KDFYAFKVPV FSFEKLPDVE
VSLGPEMKST GEVMGISKDY YVALYKGLVA SGTKLPLEGG VLFTVADPDK NEIIPIAEKF
EKLGFKIYAT SKTAKHLNFY QVAANYVKKV SEGSPNIIDL IRKGEINIVI NTPTKGRQPQ
RDGFLIRRFA VENKVPIFTS VDTAKAVVEI IEFLKQKREL DIFNIGEIDN EAIRR