Gene Clim_2240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2240 
Symbol 
ID6355263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2467204 
End bp2468274 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID642669832 
ProductGTPase EngC 
Protein accessionYP_001944243 
Protein GI189347714 
COG category[R] General function prediction only 
COG ID[COG1162] Predicted GTPases 
TIGRFAM ID[TIGR00157] ribosome small subunit-dependent GTPase A 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAT TGAGCGATCT TGGTTTTGAC CCCTGGTTTG AAACACATGC AGATGCAATC 
CGTTCCGAGG GTCAGAGCGT GGCACGTGTT TCGGCGGTTG ACCGGAATTC CTGCATGATC
AGAAATGAAC AGGGGGAGAT TCCTGCCGAA CTTTCGGGGA AGTTTCTGTT CAATGTCGAG
TCGCCGGCAG ATCTGCCATG CGTCGGGGAC TGGGTTGCCG TGCAGTATCA CAATGACGGC
GCCCTTGCAA TCATTCATGG GCTCTTTCCC CGGAGGACGT TTTTACGCCG GAAGCGAGCC
GGCACGGAGG TGGACTACCA GATGATAGCC GCAAATATCG ATATCGCCTT TGTCGTGCAG
TCATGCCACT TCGACTTCAA TCTGGCGAGG CTGAACCGGT ATCTGGTGAT GGCGGCTGAC
GGTCATGTCG AGTCGATTGT CGTGCTTGCC AAAACGGACC TGATCTCCGG TGAAGAGCTT
CAGGAGAAGC TTGCGGCTAT CAGAGAGGCG GGCATTTCGG CCAGGGTGAT TGCGCTCAGT
AACCTGAACG GTTCCGGATT TGAAGAATTC CGTCAGCTGC TGCTGCCGCG AGGAACCTAT
TGTCTGCTTG GTTCCTCCGG AGTCGGCAAG ACGACGCTGA TCAATCATCT GATCGGACGG
GATGATTTCG ATACGAAAGC GGTCAGCGGA ACAGGAGAGG GCACGCACAC GACGACGCGT
CGGCAACTCA TTGTGCTTGA TGAAGGCAGT ATGTTCATCG ATACGCCGGG AATGAGAGAG
TTAGGCCTTT TGGGCGCCAG TGAAGGGGTA AACAAAGGGT TTGAAGATAT CACTGGGCTT
TCCAGAGCCT GCCGGTATGC CGATTGCAGC CATACCGGGG AGTCGGGTTG TGCAGTGCTT
GCTGCAATCG AAGCCGGAGA GCTGAGCGAA GAGCGCTATG CCGGTTATCT GAAACTCAGG
AAGGAGTCGG AGTACCACGA GCTGTCGTAC CTTGACAAAC GAAAAAAGGA GCGGGCATTC
GGTCGCTTTA TCAGGACGGC CAAGAAGGAT ATGAAAAGAT GGGATGGTTA G
 
Protein sequence
MTKLSDLGFD PWFETHADAI RSEGQSVARV SAVDRNSCMI RNEQGEIPAE LSGKFLFNVE 
SPADLPCVGD WVAVQYHNDG ALAIIHGLFP RRTFLRRKRA GTEVDYQMIA ANIDIAFVVQ
SCHFDFNLAR LNRYLVMAAD GHVESIVVLA KTDLISGEEL QEKLAAIREA GISARVIALS
NLNGSGFEEF RQLLLPRGTY CLLGSSGVGK TTLINHLIGR DDFDTKAVSG TGEGTHTTTR
RQLIVLDEGS MFIDTPGMRE LGLLGASEGV NKGFEDITGL SRACRYADCS HTGESGCAVL
AAIEAGELSE ERYAGYLKLR KESEYHELSY LDKRKKERAF GRFIRTAKKD MKRWDG