Gene Clim_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0334 
Symbol 
ID6353853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp370967 
End bp371947 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content54% 
IMG OID642667964 
ProductKpsF/GutQ family protein 
Protein accessionYP_001942406 
Protein GI189345877 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA CAGATGCAAT GCGAAATCTG ACATCAAGCG GAAAAACCAT TCTCGAACAG 
GAAGCCGGCG CTCTCCGGCA GATAGCCGAA CGGCTTGACG ACACCTTTGC GAGCGCCGTA
ACGGCTATGC ACGCCTGCAG CGGAAAAATC ATCATTTCCG GCATGGGAAA ATCAGGAATT
ATCGCCCAGA AAATTGCAGC AACGATGGCA TCTACCGGAA CAACCGCCAT GTTTCTTCAT
CCGGCCGATG CGGCACACGG CGATCTCGGC ATTGTTTCCG AAGGTGACGT GGTCATCTGT
CTTTCGAAAA GCGGCACGAC CGAAGAGCTT AATTTCATTC TGCCGGCACT CAGGAGAATA
GGAGTCGCCA TTATCGCACT GACCGGCAAT CCCCGCTCAT ATCTCGCCCG GAATGCCGAT
ATCGTACTTG ACACGGGCAT CGATCAGGAA GCCTGCCCTT TTGACCTTGC TCCGACCTCA
TCGACCACGG CAATGCTTGC CATGGGCGAT GCACTTGCCA TCACCCTCAT GCAGGCAAAA
CAGTTCACCC CTCGCGACTT CGCCCTGACC CATCCAAAGG GAGCGCTCGG AAGACGCCTG
ACCATGAAAG CCTCCGACAT CATGGCATCC GGTGATGCGC TCCCCATCGT CGACGATCAA
GCGGTTCTCG GTGAACTCAT TCTTGAAATG ACCTCGAAAC GTTACGGAGT CAGCGCAATC
GTTGACCGAA AAGGAAAGCT CTCCGGCATT TTTACCGATG GCGACCTCCG AAGGATTGTC
CAGAAAGGCG GCAATTTTCT TCAGCTTTCC GCCCGATCCG TCATGACCGA AAACCCGAAA
AGCGTTCCCC CCGACACCCT TGCCAAAGAG TGCCTCGACA TACTCGAAAC ATTCAGAATA
ACCCAGCTTA TGGTTTGCGA CAACGATAAC CGACCCGTCG GGATTATTCA TATCCACGAC
CTGATAACGC TGGGATTGTA G
 
Protein sequence
MNKTDAMRNL TSSGKTILEQ EAGALRQIAE RLDDTFASAV TAMHACSGKI IISGMGKSGI 
IAQKIAATMA STGTTAMFLH PADAAHGDLG IVSEGDVVIC LSKSGTTEEL NFILPALRRI
GVAIIALTGN PRSYLARNAD IVLDTGIDQE ACPFDLAPTS STTAMLAMGD ALAITLMQAK
QFTPRDFALT HPKGALGRRL TMKASDIMAS GDALPIVDDQ AVLGELILEM TSKRYGVSAI
VDRKGKLSGI FTDGDLRRIV QKGGNFLQLS ARSVMTENPK SVPPDTLAKE CLDILETFRI
TQLMVCDNDN RPVGIIHIHD LITLGL