Gene Cthe_0179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0179 
Symbol 
ID4808667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp215545 
End bp216762 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content42% 
IMG OID640105590 
Productargininosuccinate synthase 
Protein accessionYP_001036613 
Protein GI125972703 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAA AAGAAAAAGT AATCCTTGCC TATTCCGGCG GTCTGGATAC TTCAATAATC 
ATTCCCTGGC TTAAAGAAAA TTATGACTAT GAAGTCATTG CAATGGCAGC TGACGTTGGA
CAGGGCGAAG AGCTTGAACC CTTAAGAGAG AAAGCTATAA AAACCGGGGC AAGCAAAATA
TATATTGAAG ACTTGAAAGA AGAATTTGTG ACAGACTTTA TTTTCCCCAC ATTAAAAGCA
GGTGCCGTAT ATGAAGGAAA ATATCTTTTG GGTACATCCT TTGCAAGACC TCTTATAGCA
AAGAGAATGG TTGAAATTGC CTTAAAAGAA GGCGCTACAG CAGTCGCTCA CGGAGCAACG
GGAAAAGGAA ACGACCAGGT CCGTTTCGAG CTGACCGTCA AAGCTCTTGC TCCCCATCTT
AAAATCATTG CTCCGTGGAG AATATGGGAC ATTAAATCCA GAGAAGATGA AATCGAATAC
GCCCAGGCAA GAAACATCCC CATTCCTGTA TCAAAAGAAG ACAACTACAG CATGGACAGA
AACCTCTGGC ACCTCAGCCA TGAAGGCTTG GATTTGGAAG ACCCATGGAA CGAACCGCAG
TATGACAAAA TATTAAAGCT CATGGTTCCG CCGGAAAAAG CTCCGGACAA ACCCACCTAT
GTTGAAATAT ATTTTGAAAA GGGTATACCT AAAAAAGTTA ACGGTGTGGA ATACGGACCT
GTGGAGCTTA TTGAAGTTCT TAACAAAATC GGCGGTGAAA ACGGTATAGG AATCGTTGAC
ATAGTGGAAA ACAGATTGGT CGGAATGAAA TCCAGAGGCG TTTATGAAAC TCCGGGTGGA
ACCATTCTTT ATGCGGCTCA CAGAGAGCTT GAGCTTTTGT GTCTTGACCG TGACACTCTC
CATTACAAAG ATCTTGTGGC TCAAAGATTT GCAGAGCTTG TTTATTACGG ACAGTGGTAT
ACTCCTCTGC GTGAGGCTAT TTCAGCCTTT GTTGATGTTA CTCAGGAGAC GGTTACCGGT
ACAGTAAGAT TAAAACTTTA CAAAGGAAAT ATAATAAGCG CGGGAGCCAA ATCCGATTAT
TCCCTCTACA GTGAAGAACT TTCCACGTTC GGAGAAGACA ATGTATACAA TCAAAAGGAT
GCCGAAGGAT TTATAAATCT CTTTGGTCTT CCGATGAAGG TTCAGGCTCT TATGAAAGAA
AAGAATAAAG GCAAATAA
 
Protein sequence
MSQKEKVILA YSGGLDTSII IPWLKENYDY EVIAMAADVG QGEELEPLRE KAIKTGASKI 
YIEDLKEEFV TDFIFPTLKA GAVYEGKYLL GTSFARPLIA KRMVEIALKE GATAVAHGAT
GKGNDQVRFE LTVKALAPHL KIIAPWRIWD IKSREDEIEY AQARNIPIPV SKEDNYSMDR
NLWHLSHEGL DLEDPWNEPQ YDKILKLMVP PEKAPDKPTY VEIYFEKGIP KKVNGVEYGP
VELIEVLNKI GGENGIGIVD IVENRLVGMK SRGVYETPGG TILYAAHREL ELLCLDRDTL
HYKDLVAQRF AELVYYGQWY TPLREAISAF VDVTQETVTG TVRLKLYKGN IISAGAKSDY
SLYSEELSTF GEDNVYNQKD AEGFINLFGL PMKVQALMKE KNKGK