Gene Clim_0109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0109 
Symbol 
ID6356074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp119993 
End bp121300 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content59% 
IMG OID642667737 
ProductHipA domain protein 
Protein accessionYP_001942193 
Protein GI189345664 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTACAA CAGCAAGGGT AAACCTGTGG GGGCGCACGA TCGGAGCGGT ATCGCTCGAT 
AGCGACGCTG CGACTGCAAC CTTCGAGTAC GATCCGGCCT TCGCTCGGAG CGGCATCGAA
ATCGCCCCGC TGACCATGCC GCTCTCCGGT CAGCTCTACT CCTTTCCCTC GCTGCGTCCC
GAAACCTTCC ACGGGCTTCC GGGACTGTTG GCGGATTCGC TGCCGGACCG GTTCGGCAAT
ACGCTGATCG ATACCTGGCT GGCCCGTTCC GGTCGCACGT CCGGCTCCTT CAATGCCATC
GAGAGGCTCT GTTATACAGG GTCTCGGGGC ATGGGCGTTC TTGAATATGC TCCAGCCATA
CAATTGGGGG GCTCCGGCTC TGCACCGCTC GAAATCGAAC GGTTGGTCGA ATTGGCTTCG
GAGGTGTTGA CCCATCGCAA CGATCTGCAG GTCTGGTTCC TCGATGGGGG CAAGGAGCTT
GCGCTCGGGG AGATTCTCCG GGTCGGCACC TCCGCGGGCG GAGCAAGAGC CAAGGCGGTA
ATTGCCTGGA ACCCGGAAAC CGACGAAGTC CGTTCAGGCC AGGTGAAGGC CGGAAAAGGG
TTCGAGTACT GGCTGCTCAA GTTCGACGGA GTGAGCGGCA ACAAGGACAG GGAACAGGAA
GATCCAAAAG GGTACGGTGC AATCGAGCAC GCATACTACC GCATGGCGCT GGATGCGGGA
ATCACCATGA CGCCCTGCCG CCTGTTCGAG GAAAACGGTC GTCGCCATTT TATGACGAAG
CGCTTCGACC GGTTGGAGGA TGGAGGCAAA CTGCACATGC AGTCGCTCTG CGGCATGGCG
CATTACGACT TCAATCGGGC GGGAGCTTAC GGGTATGAAC AGGCGTTGCA GGTCATCAGG
CGCCTTGGTT TGCCGATGGC TTCCGTGGAG GAGCAGTTCC GGCGAATGGT GTTCAATATC
GTGGCCCGCA ACCAGGATGA CCATGTGAAG AACATTGCCT TTCTGATGGA CAGGTCGGGC
AACTGGTCGC TTGCGCCAGC GTTCGACATG ACCTGGAGCT ATCAACCGGG GGGAGCGTGG
ACATCGACCC ATCAGATGAC GATGAACGGC AAACGGAGCG GATTCCTGCC GGACGACTTC
AGGGCATGTG CGAAAAGCGC ATCCATGAAA CGCGGGCGAG CCGAAACCAT CGTCGCTGAA
GTGCAGGACG TTGTTCGCAG ATGGCATGAT TATGCCGAGG AGTCGCGCAT CACTCCCCGA
CAACGGGATA CGATTGCAAC AACGCTGAGA CTGGAGGGCT TTGTATGA
 
Protein sequence
MSTTARVNLW GRTIGAVSLD SDAATATFEY DPAFARSGIE IAPLTMPLSG QLYSFPSLRP 
ETFHGLPGLL ADSLPDRFGN TLIDTWLARS GRTSGSFNAI ERLCYTGSRG MGVLEYAPAI
QLGGSGSAPL EIERLVELAS EVLTHRNDLQ VWFLDGGKEL ALGEILRVGT SAGGARAKAV
IAWNPETDEV RSGQVKAGKG FEYWLLKFDG VSGNKDREQE DPKGYGAIEH AYYRMALDAG
ITMTPCRLFE ENGRRHFMTK RFDRLEDGGK LHMQSLCGMA HYDFNRAGAY GYEQALQVIR
RLGLPMASVE EQFRRMVFNI VARNQDDHVK NIAFLMDRSG NWSLAPAFDM TWSYQPGGAW
TSTHQMTMNG KRSGFLPDDF RACAKSASMK RGRAETIVAE VQDVVRRWHD YAEESRITPR
QRDTIATTLR LEGFV