Gene Clim_0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0099 
Symbol 
ID6355623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp105082 
End bp106356 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content57% 
IMG OID642667723 
Producthypothetical protein 
Protein accessionYP_001942184 
Protein GI189345655 
COG category[S] Function unknown 
COG ID[COG4198] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00564358 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGAAA TTCAGCCCTT CAGGGCAATA CGCTACAATC CCGAAACTGC GGGAGATGCC 
GCAAAACTGA TCTGCCCGCC TTACGACGTT ATATCCAGCG AGCTGCAGCA GCAGTTGAAC
GACTCTTCTC CGTTCAACGC CGTCCGGCTG GAACTGCCTG TCGAAGCCGA CCCCTATGCG
GCATCTGCCG GGCGTCTGCA CGATTGGCTG CAGAACGGCG AGCTGCTTCG CGATACGGAG
CCCGCGCTCT ATCCCTACAG CCAGACCTTT ACCGATCAGG CGGGAAATAC CCATAATCGT
TGCGGCTTTT TCGCGGCCAT GCGGCTGCAC GAGTTTGCAG AAAGGAAGGT GCTGCCGCAC
GAACGGACGC TTTCCGGCCC CAAGGCCGAC CGGCTGAACC TGTTCCGCAA AACGCGCACG
AACATCAGCT CGATTTTCGG CCTCTATGCC GATGATGAAG GAGAGGCCGA CCGCATCGTT
TCCGCCTATA CCGACGCCAA TGAACCGCTG GTCGATGCCG TGTTCCAGGG CGTGCGCAAC
CGGCTGTGGC GAATCGTCGA TCCGGAGCTC GTGGCCGGTA TCCAGCAGGC GCTGGTTCAG
CGGACGGTCT ATATCGCCGA CGGCCATCAC CGTTATGAAA CCGGTTTGAA CTATCGCAGG
GAGCGTGCGG AAGAGAATCC TGAACATTCC GGAAACGAAC CGTACAATTT TATCCTTGTT
TATCTGAACA ATATTTATGA TAAAGGACTG GTGGTTTTTC CCATACACCG TCTGGTGCAC
AGCCTGGAGG ATTTCGACGC TTCGCGGCTC AGGGAGCGGC TCGAAGAGCA TTTTGCCGTA
ACGGAGCTTG ACGGACGCGC TGAACTGAAG GCCTATCTCG AAGCCGCCTC ATCGAGTTAT
GCCTATGGAG TGGTGAGCGA AGGTTCCGTT CTGGGGATTA CGCTGAAAAG CGATCCGGCA
GGCATGCTCG ACAGCGGCAT CGCTCCGGCC CTGCAGCAGC TCGGCCTTGT TGTGCTGCAT
GAGGTTGTGC TGAACCGCCT GCTTGGCATC GGTCAGGAAG CCATGGCAAA GCAGACCAAT
CTGGTGTATG TGAAGGACGA CGGCGATGTT TTCGATGCGG TCGTATCCGG CAGGGTTCAG
GCCGGATTTG TCGTCAAGCC GGCCACGGTC GGGCAGGTTC TTGCCGTTTC CGAATCCGGT
GGCGTGATGC CGCAGAAATC GACGTTTTTT TACCCGAAAA TAATGACGGG ACTTGTTTTC
AATCCGCTCG ACTGA
 
Protein sequence
MPEIQPFRAI RYNPETAGDA AKLICPPYDV ISSELQQQLN DSSPFNAVRL ELPVEADPYA 
ASAGRLHDWL QNGELLRDTE PALYPYSQTF TDQAGNTHNR CGFFAAMRLH EFAERKVLPH
ERTLSGPKAD RLNLFRKTRT NISSIFGLYA DDEGEADRIV SAYTDANEPL VDAVFQGVRN
RLWRIVDPEL VAGIQQALVQ RTVYIADGHH RYETGLNYRR ERAEENPEHS GNEPYNFILV
YLNNIYDKGL VVFPIHRLVH SLEDFDASRL RERLEEHFAV TELDGRAELK AYLEAASSSY
AYGVVSEGSV LGITLKSDPA GMLDSGIAPA LQQLGLVVLH EVVLNRLLGI GQEAMAKQTN
LVYVKDDGDV FDAVVSGRVQ AGFVVKPATV GQVLAVSESG GVMPQKSTFF YPKIMTGLVF
NPLD