Gene Clim_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1024 
Symbol 
ID6353726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1127811 
End bp1129004 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content57% 
IMG OID642668647 
Productputative iron complex transport system substrate-binding protein 
Protein accessionYP_001943078 
Protein GI189346549 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.264421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGC AGATGAGATT CGTTTCATTC CGGATCATGC GGGCAGTACT GGTGACGGTA 
CTGCCCCTTC TGCTGCTGGC GGGGTGCCGG CAGGAGCGGG AACGGCCGGT CAGAAGGGCT
GCGAAGAGGG ACGTGGAGCA GGAAATTCCC CTGCGTTACG CCCGCCGTTT TACCATGAAA
AAGGTTGGTT CCTGTACCCT CATCGAGATC AGGAAGCCGA AGGGGGCCAG GCTCGGGGTG
TTCTATCGCT ATCTGCTCGT TCCCGAAGGA GAAACCGCGC CTTCCGGGTA TCCTGACGCC
CTGGTTGTGG CTACTCCTGT CAGGAAAGTA ACCTGCGGTC TGGGGCTGCA GGTAGCCATG
ATCGGGCAGC TCGACCGGAT TGAAAGCATA GCGGGGGTGG GCATGGGGAA GTGGACGGGA
AACCCTGAGA TCCGCCGGAA GATGGCTGCC GGAGAGGTGC TCGAGACGGG CATGTCCGCC
GATATGAACA TGGAGGCCAT GGTGAGCATC GACCCTGATA TCGCCTTCGT CTACTCGTCG
GGAAGCGATA CCGACATCCA TGACAAACTG CTTTCGATGG GCATCAGGCC GGGGCTGGTG
TGCATGCACC TCGAGGAGCA TCCTCTCGGC GTTCTGGAGT GGATACGGTT TTTCGGTGCG
TTTTATGGCA GGGAGAAGGA GGCGGAGGCC TGTTTCAGGA GCGCAGCGGA ACGTTATGAA
AAACTCGAAA CATCGGTGAA GGATTCTTTC AGCGTGTGTC CGACGGTTAT TGTCGGCCAC
GCCACCAGAG GCATCTGGAC CACGCATGGT TCGAGCGCAT GGTTCATCAG GTTCCTGCAC
GACGCAGGAG CGCGCTACAT ACTCGAAGAG AGCGGCGAAT ACGAAGAGAA TCCGGTCAGT
CTCGAACACG CCCTCAAGGT CGGCATCGAA GCCGAATACT GGGTCAATCC CCGGTACAAT
GCGAAAACCA TTACCGACCT GCTTGGCGAT GACAAGCGCT ATCAGTATTT CTCTTCGGTC
AAATTCGGCA AGGTGTTCAA CAACGATAAC CTCACCTTCG ACGACGGACG GACGCTGTTC
TGGGAGACGG GCATGATGGA ACCGGACGAA GTGCTCAGGG ATCTCGTCGC GATTTTTCAT
CCCGGGCTGG TTCCGGGGCA TCGAATGAAA TACTATCGCA GGATGATGCG CTGA
 
Protein sequence
MEKQMRFVSF RIMRAVLVTV LPLLLLAGCR QERERPVRRA AKRDVEQEIP LRYARRFTMK 
KVGSCTLIEI RKPKGARLGV FYRYLLVPEG ETAPSGYPDA LVVATPVRKV TCGLGLQVAM
IGQLDRIESI AGVGMGKWTG NPEIRRKMAA GEVLETGMSA DMNMEAMVSI DPDIAFVYSS
GSDTDIHDKL LSMGIRPGLV CMHLEEHPLG VLEWIRFFGA FYGREKEAEA CFRSAAERYE
KLETSVKDSF SVCPTVIVGH ATRGIWTTHG SSAWFIRFLH DAGARYILEE SGEYEENPVS
LEHALKVGIE AEYWVNPRYN AKTITDLLGD DKRYQYFSSV KFGKVFNNDN LTFDDGRTLF
WETGMMEPDE VLRDLVAIFH PGLVPGHRMK YYRRMMR