Gene Clim_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1408 
Symbol 
ID6356179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1513173 
End bp1514510 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content50% 
IMG OID642669019 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_001943447 
Protein GI189346918 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0334536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA CAATATTTGG GTCCGGGTAC GTCGGCCTTG TCACTGGAGC ATGTTTTGCC 
GAGGTCGGCA ACGAAGTGTT GTGTGTGGAT ATTGATCAGG CAAAAATCGA CAGGCTCAAT
AACGGAGAAA TTCCTATTTA TGAACCTGGT CTTGATGCCA TTGTGCATGA GAACAGCCGG
AAGGGCCGTT TGCGGTTTAC TTCGAATATT CCTGAAGGCG TCGAGTTCGG TCTCTATCAG
TTTATTGCCG TAGGCACTCC GCCCGATGAA GACGGTTCAG CCGATTTGCG CCATGTACTC
AGTGTTGCGG AAAGTATTGG CGCCCACATG CAGGATTACC GCATTATCAT CAATAAATCG
ACAGTCCCTG TCGGTACTGC GGATCTGGTT CGTGAAAAGG TACTTTCAGT ATTGGATGCA
AGGAATGCCG GCATCGATTT CGATGTGGTG TCAAATCCGG AGTTCCTCAA GGAGGGAGAT
GCGGTCAACG ATTTCATGAA ACCGGAACGG ATCGTGGTCG GCGTCGATAA TCCCCGAACC
AAAGAGCTGC TTCGTTTTCT TTATTCGCCA TTCAACCGCA GCCACGAGCG TTTTATCGCC
ATGGATGTCC GTTCGGCCGA GTTGACCAAA TACGCAGCAA ATTCGATGCT TGCGACGAAG
ATCAGCTTCA TGAACGAGAT TGCCAACATT GCCGAACTTG TGGGAGCAGA TGTCGAGGAG
GTTCGCAGAG GCATAGGATC GGATTCCCGC ATCGGCTTTT CTTTTATTTA TCCCGGTGTA
GGTTACGGGG GCTCCTGTTT TCCGAAGGAT GTTCAGGCTC TCGAACGTAC CGCCCGAAAA
CACGGATATG ATTCCCGGAT TCTTCAGGCT GTCGAGGCAG TCAATCACGA TCAGAAAAAC
AGTCTGGTAC GCAAGATGAA GGAGCATTTC AATGGAGATC TCAAGGGTAA GGTTATCGCG
CTCTGGGGTC TTGCGTTCAA GCCCAATACC GATGATATGC GTGAAGCTCC CAGCCGCAGG
GTGATTGAAG AACTTTGGAA GGAAGGTGCA CTGGTAAGGG TTTACGATCC GGTAGCCATG
GAAGAAGCTC AAAGGATTTA TGGCGAAAAA GAAGGCTTGC ACTATGCCGA AAGTCCGGAT
GAAGCAGTCT CCGGAGCTGA TGCACTTGCA ATTCTGACCG AATGGCTGAT GTTCCGCAGT
CCGGATTTCG ATATGATAAA ACGGGAACTC AAGGAGCCGG TGATTTTCGA CGGGCGGAAC
ATCTATAGTC CTGATTTTAT GGAGCAGTTC GGTTTTACCT ACTACTCGAT CGGCAGACGA
CCGAGAGGTA TCAGCTGA
 
Protein sequence
MKITIFGSGY VGLVTGACFA EVGNEVLCVD IDQAKIDRLN NGEIPIYEPG LDAIVHENSR 
KGRLRFTSNI PEGVEFGLYQ FIAVGTPPDE DGSADLRHVL SVAESIGAHM QDYRIIINKS
TVPVGTADLV REKVLSVLDA RNAGIDFDVV SNPEFLKEGD AVNDFMKPER IVVGVDNPRT
KELLRFLYSP FNRSHERFIA MDVRSAELTK YAANSMLATK ISFMNEIANI AELVGADVEE
VRRGIGSDSR IGFSFIYPGV GYGGSCFPKD VQALERTARK HGYDSRILQA VEAVNHDQKN
SLVRKMKEHF NGDLKGKVIA LWGLAFKPNT DDMREAPSRR VIEELWKEGA LVRVYDPVAM
EEAQRIYGEK EGLHYAESPD EAVSGADALA ILTEWLMFRS PDFDMIKREL KEPVIFDGRN
IYSPDFMEQF GFTYYSIGRR PRGIS