Gene Ccel_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3103 
Symbol 
ID7311700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3639025 
End bp3640215 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content41% 
IMG OID643610007 
Productsmall GTP-binding protein 
Protein accessionYP_002507375 
Protein GI220930466 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTCA CAAATACACC GGGTGCAAAC AGGCTGCACA TTGCACTATT CGGCAGAAGA 
AACAGCGGAA AATCTTCACT AATTAATACA ATAACAGGAC AGGACATTGC TCTGGTATCG
GAAATTGCCG GAACCACTAC AGATCCCGTA TACAAGGCTA TGGAACTGCA CCCTATAGGC
CCTGTTATGT TTATTGATAC GGCAGGCTTT GATGATGTGG GCACTTTAGG CGAACTAAGA
ATAGAGAAAA CACGAAAGGT AATTGATAAA ACGGATATAG CCATTGTAAT TTTCTCTGAA
ACCGAGCTTT CGATGGAAAA GGAATGGATG AATGAGCTAA AAAAACGTAA AATACCCGTT
ATTCCCATAA TAAATAAAGC GGATATACTT AATAATACAG ATGATATAAA AAAGCAGGTG
GAGGAAACTC TGGGCCTGAT GCCAATCATT ATCAGTGCAA AGGAAAAAAC AGGGCTTGAT
AAAGTCAGAG AAGAGCTAAT TAGAGCGGTG CCGGAGGATT TTGAGGTAAG CAGCATAACC
GCCCATCTTG TAAATGAAGG AGATTTTGTT TTGTTGGTCA TGCCTCAGGA CATTCAAGCT
CCAAAGGGAC GCCTGATTCT GCCTCAGGTG CAGGTAATCA GAGATTTACT GGATTTAAAA
TGTATTGTTA TGAGTGTTAC TACCGACAAG CTTGAAAATG CACTAAAGGC AATGTCAGGA
CCTCCCAAAT TAATAATTAC CGATTCACAG GTGTTCGACA AAGTATATGC TAAAAAACCT
GAAGAAAGCC TGTTGACATC ATTTTCCGTT CTGTTTGCAG AATATAAAGG TGATATTTCT
GCATACATTA AAGGTGCTGA AGCAATAGAT GCACTAACTG AGAATTCAGC CGTTCTGATA
GCGGAAGCCT GTACCCATGC ACCTCTAAGT GAGGATATCG GACGTGTGCA GCTGCCAAGG
CTTCTCAGGG AAAAGATAGG AAAAGGTCTA ACTGTTGACA TTGTAAGCGG GAGCGACTTT
CCAAAAGATT TGTCAAAATA TTCACTGGTC ATTCAGTGTG GCTGCTGCAT GTTTAACAGG
AAATATGTAC TCTCACGTAT AGAGTCTGCT AAGGCACAGA ATGTAAGAAT TTGTAATTAC
GGAATTGCAA TCGCGAAGCT AAGAGGCATA CTAGAAAAAG TTGCATTATA G
 
Protein sequence
MSLTNTPGAN RLHIALFGRR NSGKSSLINT ITGQDIALVS EIAGTTTDPV YKAMELHPIG 
PVMFIDTAGF DDVGTLGELR IEKTRKVIDK TDIAIVIFSE TELSMEKEWM NELKKRKIPV
IPIINKADIL NNTDDIKKQV EETLGLMPII ISAKEKTGLD KVREELIRAV PEDFEVSSIT
AHLVNEGDFV LLVMPQDIQA PKGRLILPQV QVIRDLLDLK CIVMSVTTDK LENALKAMSG
PPKLIITDSQ VFDKVYAKKP EESLLTSFSV LFAEYKGDIS AYIKGAEAID ALTENSAVLI
AEACTHAPLS EDIGRVQLPR LLREKIGKGL TVDIVSGSDF PKDLSKYSLV IQCGCCMFNR
KYVLSRIESA KAQNVRICNY GIAIAKLRGI LEKVAL