Gene Clim_0548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0548 
Symbol 
ID6354899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp619278 
End bp620309 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content52% 
IMG OID642668184 
Producthypothetical protein 
Protein accessionYP_001942619 
Protein GI189346090 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR00661] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCC TTTTCGGGGT CCAGGGTACG GGAAACGGAC ATATCAGCCG CAGCCGGGAG 
CTGGTAAGAA AGCTCAAGGA GAACGGCCAT GATGTTGCGG TGATTATCAG CGGAAGAAAA
GAAGAGGAAC TGAAGGAGAT CGGTATCTTC GAGCCGTATC GGGTTATGAA AGGACTTACG
CTGGTGACCT ACAAGGGCAG GATGAATTAC ATGGAAACCA TGTTTCAGCT CGATCTTGCA
CGTCTTATGA GTGATGTTCT GATGCTCGAT ACGTCGGGTA TAGACCTCAT TATTACCGAT
TTTGAACCGA TCACCTCGAT GGCGGCCCGG ATAAAGAATA TTCCCTGTAT GGGGTTCGGC
CACCAGTATG CGTTTCGTTA CAACATACCG TTTGCGCGCG GCAGCATTTT CGAAAAGTAC
ACGCTTCTGA ACTTCGCTCC GGCCAGATAT AACGCGGGAT TGCACTGGAG CCATTTCAAC
CAGCCGATCT TTCCTCCGGT TATTCCTGAA ATGCTGTATG TTTCACAAAA ACGTGAGGTT
GACAGCCGCA AGCTTCTCGT CTATCTTCCG TTTGAAGAGG TAGAGGATGT CGCTGCCTTT
GTAAGGCCTT TCGGAAATTA TCAGTTCTGC ATTTATGGCA AGGTGAAAGA AAATCTTGAC
GAAGGTCACC TGCATTTCAG GAGTTATTCG CGCGAGGGCT TTCTGAATGA TCTGACGGAG
TGTAACGGCG TGGTCTGCAA TGCAGGGTTC GAACTGCCGG GCGAAGCGCT GCATCTTGGC
AAGAAACTGC TTCTCCGCCC TCTTGACGGA CAGATCGAGC AGGAATCCAA CGCGCTTGCC
ATGGAGGAGC TGCAGTATGG CATGGCCATG CATTCGCTCG ACCCCGACCT TCTCGCCAGC
TGGCTTGAAC TGCCAGGGCG CGAGCCGCTG AACTACTCCC GTACGGTTGA TTTCATTGCC
GAATGGATCG GAAGCGGAGA CTGGGAGGGG CTTTCCCGAT ACACGGAAGC CGCTTGGAAG
GCAACATTCT GA
 
Protein sequence
MKILFGVQGT GNGHISRSRE LVRKLKENGH DVAVIISGRK EEELKEIGIF EPYRVMKGLT 
LVTYKGRMNY METMFQLDLA RLMSDVLMLD TSGIDLIITD FEPITSMAAR IKNIPCMGFG
HQYAFRYNIP FARGSIFEKY TLLNFAPARY NAGLHWSHFN QPIFPPVIPE MLYVSQKREV
DSRKLLVYLP FEEVEDVAAF VRPFGNYQFC IYGKVKENLD EGHLHFRSYS REGFLNDLTE
CNGVVCNAGF ELPGEALHLG KKLLLRPLDG QIEQESNALA MEELQYGMAM HSLDPDLLAS
WLELPGREPL NYSRTVDFIA EWIGSGDWEG LSRYTEAAWK ATF