Gene Clim_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1474 
Symbol 
ID6354787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1579699 
End bp1580979 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID642669081 
ProductHipA domain protein 
Protein accessionYP_001943509 
Protein GI189346980 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.738708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCGTT CCGTAAACCT TATTGAAGTT CGCGCCTGGA ATCGCACCGT CGGAACCGTT 
ACGATGGAAC CCGGATCCGG CAGCTGCATC TTCGAATACG ACCCCTCGTG GCAGGACGGA
GGCATCGAGC TTGCGCCGCT GACCATGCCT CTCAGTCAGG CCGTCCACGC TTTTCCGCTG
CTGCCTGAAG CCACCTTCAT GCGGCTCCCC GGACTGCTTG CCGACTCCAT ACCCGAAGGG
TTCGGCAGCA GGCTGATCGA GACCTATCTG GTAAACGAGG GACTCTCACC TGAAGCGATA
ACGCCGCTGG ACCGGCTCGC ATACATGGGC AACAGCGGCA TGGGCGCGCT GGAGTTCCGC
CCTATGCGCG GCCCTCGCTT TTCGAAACCG AAAGAACTGG AAATTGATTA TCTGGTTGCC
GCATCGGACG CGGCGTTTGC AGCAAATATC CACAACGACC GTGAAACCGA GGCCGCATTG
ACGAACCTCT TTCAGGTCGG CGCATCCGCC GGAGGCAAAC AGCCGAAAGC CGTCATTGCA
TGGAACGAGG AGAGCGACGA AATCCGCTCG GGTCAACTGC CGGCCGGGCC GGGTTTCGAG
CTGTGGATAA TCAAGCTTGA TGGCGTAGCC ATGGATTGTG ATACGAGCTG TGAAAGCTCT
TTCGGACGGA TCGAATACGC CTATTCGATG ATGGCAAAAG CTGCCGGTAT TGCCATGACG
GAATGCGGCC TGCTCGAAGA GAACGGACGG GCGCACTTCA TGACTCGCCG CTTCGACCGC
CGGGATGGCG AAAAACTGCA TCTGCAGAGC CTCTGCGCCT TGCGGCATCT CGACTGCATT
GAAGGGGAAA CCCACGATTA CGACCGGTAC TTCGAAACCG TCAAAGCACT CGGTCTGCCG
GAACCGGCCA TGCAGGAGGC CTTCAGGCGC ATGGTTTTCA ACGTGCTTGC GGCAAACTGC
GACGATCACA CCCGAAGCCT CTCGTTCCTC ATGGATGCTG CCGGAACCTG GTCACTCTCC
CCCGCGTACG GTCTGACGCA CGCCTTCACC CCATACGGAG AGTGGAAGTT CAGGCACAGG
ATGTCCGTCA ACGGAAAGTT CCGCGATATT GCACGACAGG ATTTCGAAGC GGTTGGAAAG
CGCTTTTCAG TGCCCGACCA TGAAGGCATC GTCAGGGATG TGGCCGAAGC CGTCCGACGC
TGGCCGGAAT TCGCCGCCGC CGTGCGGTTG AATCCGGAAA CCCTGCTCCG CGTTCAGCAG
GATTTCCCGG ATATGGGATA A
 
Protein sequence
MYRSVNLIEV RAWNRTVGTV TMEPGSGSCI FEYDPSWQDG GIELAPLTMP LSQAVHAFPL 
LPEATFMRLP GLLADSIPEG FGSRLIETYL VNEGLSPEAI TPLDRLAYMG NSGMGALEFR
PMRGPRFSKP KELEIDYLVA ASDAAFAANI HNDRETEAAL TNLFQVGASA GGKQPKAVIA
WNEESDEIRS GQLPAGPGFE LWIIKLDGVA MDCDTSCESS FGRIEYAYSM MAKAAGIAMT
ECGLLEENGR AHFMTRRFDR RDGEKLHLQS LCALRHLDCI EGETHDYDRY FETVKALGLP
EPAMQEAFRR MVFNVLAANC DDHTRSLSFL MDAAGTWSLS PAYGLTHAFT PYGEWKFRHR
MSVNGKFRDI ARQDFEAVGK RFSVPDHEGI VRDVAEAVRR WPEFAAAVRL NPETLLRVQQ
DFPDMG