Gene Clim_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0834 
Symbol 
ID6353904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp915887 
End bp917839 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content52% 
IMG OID642668457 
Productalpha amylase catalytic region 
Protein accessionYP_001942892 
Protein GI189346363 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGATA ATCCTGTATT TACAGAAAAA CGGCCTGTCG AACGCACTCT CAGGGAGATT 
GATTTTTCAG AACTTGTCAA AGGAAAACAG TTTTATCCTT CCCCGGCTTC GTGGGAGGAT
GAGGTGCTTT ATTTTCTTTT TCTCGACCGT TTTTCTGACG GCCTTGAATC CGGCGGTTTT
GCATCTCTTG ACGGAATGCC GGTTGAAGGA AACGATTCAG GCAGAACGAC GTTGCTTTTT
TCACCGCAGA CCGATGCCGG AACAGCTGAC CGGGATGCGT GGTTTGAAGC GGGAAGAAAC
TGGTGCGGCG GAACTATTGC AGGCATGAAG GATAAACTGG GTTATCTGAA ACGACTCGGA
ATTACGTCCG TATGGGTCAG TCCGGTGTTC AGGCAGGTTA CGGGGAGCGG CGATTATCAT
GGTTACGGAA TCCAGAATTT TCTCGATGTC GATCCGCATT TCGGAACAAG GGAAGAGCTG
AGGGATTTTG TTGCCGCTGC GCATCAGTCC GGCATCAGGG TTATTCTTGA TATTATCATC
AATCATGCGG GTGATGTGTT CGCTTATGAG GGCAATGCGC AGTATACCTA TCAGGATGGA
TGTGAATGGC CAGTGCAGGG ATACCGCCGG CACAGCGGTG ATCCGGGAAG CCTGCCGTTC
GGCAGGATGG ATTTCGAAAA TACCGATGGA GCTGTCTGGC CGGTGGAACT GCAGGACGAG
AGTACCTGGT CAAAGCATGG CGAAATCAGG AACTGGGACT GTTTTCCGGA ATTTCTCGAC
GGAGATTTCT GTACCCTCAA GGATGTCCAT CTCGGCGATG CCCCTAAAGA TCCTGCCCTG
GCCTGGGATC TGCAGCGTCG TATTCGCGAG TTCAGACCTT CGAATGCGCT CAGGCATCTT
ACGGAGATCT ATCAGTTCTG GATTGCCTAT GCCGATATTG ACGGGTATCG GCTTGATACG
GTCAAACATA TGGAGCCTGG AGCAGTACGG TATTTTGCAA CTGCCGTGCA TGAGTTTGCC
ATTTCGGCAG GCAAGGAAAA TTTCTATATC ATCGGCGAGA TTACCGGAGG ACGCTCCTAT
GCGGCATCTA TTCTCGACAG CACAGGTCTC GATGCCGCAC TGGGCATCAA CGACATTCCC
GATAAGCTGG AGTTCATGGT GAAAGGGTGG CGCAGTCCCG GCAATCCCGA TACCGATGAG
CAGGAAGGGT ATTTCGATCT TTTTCGCAAC AGTCTGCTCG ACAACAAGCA TACCCGGCAG
TGGTACGGCA AGCATATCGT CACCATGTTC GACGATCACG ACCAGGTTGG AGTCAGGCAC
AAGTTCCGTT TTGCGGGCGA TGATTTCCGA AGCGAACTGC TTCTGCCGGT TGTGCTTGGC
CTGAACCTCG CTTCAGCAGG GATTCCCTGT ATTTATTATG GAACCGAGCA GGCGTTCAAC
GGTGCCGATC ATCGTCGGGA TGACGACTCG TACAGCGACG TTTTTCTGCG TGAGTGCATG
TTCGGCGGAC CATTCGGTTC GAGGCAGAGT GTCGGCAGAC ATTTTTTCAA CGAATCGCAT
CCGGTTTACC GGTTTATCCG CGATGTGACC GCGTTACGTC ATGATCATAT CGAGTTGAGG
CGTGGGCGGC AGTACCTGCG TCAGGTTTCT GCTACGGGTT TCGATGGTGA TTTTTACTAT
CCGCAGCCGA TGAACGGTCA ATTGCACTGG ATTATCGCCT GGTCCCGCAT TTTTGCACAG
AGGGAGCTGC TTTGTGCCGT CAATACCGAT ACGGATAACG GGTTGACTGT TTTCGTCGTG
GTTGACAGTT CGATACATCC TCCCGGCTCC TCCATGCAGT GTCTTTATAC AACCGCAGAT
GATTTTCAGC ATCATGCCGT TACGGTGGAA GCGAGGCAGG GTTCTTCGAT TCGAATTACG
GTTCCGGCTG GAGGTTTTGT GGTGTACGGA TGA
 
Protein sequence
MSDNPVFTEK RPVERTLREI DFSELVKGKQ FYPSPASWED EVLYFLFLDR FSDGLESGGF 
ASLDGMPVEG NDSGRTTLLF SPQTDAGTAD RDAWFEAGRN WCGGTIAGMK DKLGYLKRLG
ITSVWVSPVF RQVTGSGDYH GYGIQNFLDV DPHFGTREEL RDFVAAAHQS GIRVILDIII
NHAGDVFAYE GNAQYTYQDG CEWPVQGYRR HSGDPGSLPF GRMDFENTDG AVWPVELQDE
STWSKHGEIR NWDCFPEFLD GDFCTLKDVH LGDAPKDPAL AWDLQRRIRE FRPSNALRHL
TEIYQFWIAY ADIDGYRLDT VKHMEPGAVR YFATAVHEFA ISAGKENFYI IGEITGGRSY
AASILDSTGL DAALGINDIP DKLEFMVKGW RSPGNPDTDE QEGYFDLFRN SLLDNKHTRQ
WYGKHIVTMF DDHDQVGVRH KFRFAGDDFR SELLLPVVLG LNLASAGIPC IYYGTEQAFN
GADHRRDDDS YSDVFLRECM FGGPFGSRQS VGRHFFNESH PVYRFIRDVT ALRHDHIELR
RGRQYLRQVS ATGFDGDFYY PQPMNGQLHW IIAWSRIFAQ RELLCAVNTD TDNGLTVFVV
VDSSIHPPGS SMQCLYTTAD DFQHHAVTVE ARQGSSIRIT VPAGGFVVYG