Gene Acel_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0637 
Symbol 
ID4485543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp691204 
End bp692658 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content70% 
IMG OID639729405 
Producthypothetical protein 
Protein accessionYP_872396 
Protein GI117927845 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.110504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0645726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGATA ACCTGCGGCT GCTTGCCATC CTGCTCGGCG TCGTCCTCAT CGCGGCCGGC 
ATCGGCTGGC GGACCGTCCG GTTCGCGCGG ATTCGGCAGG GCATCCGCAG CGGCCTTGCG
GACCCCGACC CTGCAGTGCG CATCGCCGCG GTGCGGCAGG CGGCCGAGCT CGGCTTGGCC
AGCACGGCTC CGGCGCTGCT CCGCGCCGTC CGCGTCGAGA CCGACCCGGC GGTCCGGGCG
GCGGTCGTCG AGTGTGTCGC TGCTCATCAA TGGGAACCGG CGTCCACTGC GGCGATCGTC
GAGTTGCGGT TGTGGGCGAA GGCGTACGTC GGCGCACGCC CGGATCTCGC GCGATCACCG
GTCCGGTCGG CTCCGCTGCT CGCCGGGGTG GCGGGGACCG TGCCGGCGCC GTCCCTGCAT
CCTGTCCGCG AACACGAATT CCGGCGCCGG ACCGCGACCG ACCTGCACGT CGACGACGCC
GCCGCGCTGG GGCCGCCGCC GGACGACGAT CCGCTGGCCC GACTGCGGAT TCTGGTCACC
GGTGTCGGCG GTGCAGCCGG CGTCGCCGTC GTCCGGGCAT TGCGGGCCGC CGGGCACACG
GTGATTGGCG TGGACGCCGA TCCGGACGCT GTGGGGATCC GGCTTGCCGA TCACGGCCAA
GCCGTGCCGC GAGCCGACGA TCCGACGTAC CTGACCGCGC TGATCCACGC TGCGACGCTG
TTCGACGCCG AGCTGCTCGT GCCGACCGTC ACCGAAGAAT TGCCCGTTCT CATTGCCGGT
GCAGGGTATT TGCGTGACGC CGGGTTGCGC TTTCTCTTTC CCGACGCCCG CACGGTACAG
ACCTGCAGGG ACAAGTGGGC GTTCTACCAG GCGATGCGCG ACGCTGGGGT TCCGGTGCCG
GCCACCGCTC TCGGCACCGC GGAGGGCGTT CCCGGACCGT GGATCGTCAA GCCCCGTTTC
GGTCGCGGTT CGCGTGATGT GCACCGGGTC ACGGCTCCCC AGGAGCTGGC CGCCGCATTG
ACGTTGGTGG CCGAGCCGAT CGTGCAGACC GCGCTGGCCG GCCGGGAATT CACCGCGGAC
GTTCTCGCGC ATCCGGCGGA AATTGTCGCG GGCGGCGCGT TGCGGTGGCG GCTTGCGACC
AAGGGCGGTA TTTCGACCGT GGGGGAGACG TTCTCCGACG ATTCCGTGAT GCAGACCGTT
GCGTTGACCA TCAAGGCGCT CGGCCACATC GGGCCGGCGA ATATTCAGGG GTTCGTCTCC
GACGACGGCG CGGTCACGGT CGTGGAGGCA AATCCCCGCT TCTCCGGGGC GCTTCCGCTC
TCCTTGGCTG CGGGTGCGGA CCTAGTCGGC GAATATGTCC GTGCCATCGT CGGGCGGGCC
GTGCGAACCG AGCGGCTAGC CGCCCGCCCC GGCGTCCGCA TGTACCGGTA TTTCGACGAG
GTGTACCAGA CGTGA
 
Protein sequence
MVDNLRLLAI LLGVVLIAAG IGWRTVRFAR IRQGIRSGLA DPDPAVRIAA VRQAAELGLA 
STAPALLRAV RVETDPAVRA AVVECVAAHQ WEPASTAAIV ELRLWAKAYV GARPDLARSP
VRSAPLLAGV AGTVPAPSLH PVREHEFRRR TATDLHVDDA AALGPPPDDD PLARLRILVT
GVGGAAGVAV VRALRAAGHT VIGVDADPDA VGIRLADHGQ AVPRADDPTY LTALIHAATL
FDAELLVPTV TEELPVLIAG AGYLRDAGLR FLFPDARTVQ TCRDKWAFYQ AMRDAGVPVP
ATALGTAEGV PGPWIVKPRF GRGSRDVHRV TAPQELAAAL TLVAEPIVQT ALAGREFTAD
VLAHPAEIVA GGALRWRLAT KGGISTVGET FSDDSVMQTV ALTIKALGHI GPANIQGFVS
DDGAVTVVEA NPRFSGALPL SLAAGADLVG EYVRAIVGRA VRTERLAARP GVRMYRYFDE
VYQT