Gene Clim_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1145 
Symbol 
ID6353661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1241978 
End bp1243399 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content53% 
IMG OID642668762 
ProductEndonuclease/exonuclease/phosphatase 
Protein accessionYP_001943193 
Protein GI189346664 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0301783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGTTT ACTCGTCACT CAGGATCAGG GAAAACGATG CCGGACACAT CAAGGCCTGG 
AAAAAAAGAA CCGCAGAGAA GCTGCTTCTG CTGCGTGCGG CGCTCGATAC CCATATCGCC
GATGCTTCAG GGGAGGAGTC TGCCGAACGC GCCTGCGAAA GGAACGGACG GCAATGGCTG
AGACTTGCCA CATGGAATAT CCGCGAGTTC GATACGTTAA AATATGGTGG TCGTCTCAAG
GAGTCACTCT ATTTTATTGC TGAAATAATT TCGCATTTCG ATATCGTAGC GCTTCAGGAA
GTTCGTGAGG ACCTTGCCTG CCTGCAGTCG GTCGTACAGT TTCTCGGTCA GCATGAGTGG
GATTATATCG CAACAGACGT TACCGAGGGC TCTTCGGGAA ACCGGGAACG TATGGTGTTT
ATCTACCAGA AAAACCGGGT GCGTTTCACC AGCATAGCGG GCGAGGTGAT GCTTGACAAA
GGCGATCTGG TCACCGATTC TTCCGGCTTG TGCTTTCGCG ACGCTTCGGG GCTGAAAGTG
GAGTTTCCTG AAGGTGTTAC GCTTTTGCCT TCCGGCGATG TTCCTGTTAT AAAAAGAAAA
GGCAAGGTGC TGCTGGAGGA CGATCTGGTG ATTCCTCTTC CCGCCGGTAC CAGGATAGTT
TTGCCTGAAG GAAGTTCGCT TGTTCTGCCC GGCGGCACTC AGCTTCCTGT TGAAAACGGT
CAGGTCGCCC TGGATGCAGC TTCGCATCAG GCATGGTCGC CTCATGCGCT GGTCCGACCG
CCGTATGATC TTCTTTCGGG TATCGGCCTG CAGTTCGCAC GTTCACCTTT TCTTGTCACC
TTCCAGGCTG GCTGGCTGAA ATTCATTCTC TGTACCGTCC ATATCTACTA TGGAACGGGC
AAGGAAGGGC TGGCCAGGAG AAACGAGGAG ATCAGGAAAC TAACCCGTTT TCTTTCACGG
AGGGCCGAAA GCGAGCATGA TTCCGATGCA GAAAACTTCT TTTTCGTGCT CGGAGATTTC
AATATTGTGG GAAAGAAACA TGTGACCTGG GAGTCGCTGC ATTCGAACGG TTTCAGGGTT
CCCGAACAGC TTCAGAAGAT TCCTGCCGGC AGCAATGCGG CACGCGACAA GGCGTATGAC
CAGATCGCCT TCTGGCAACC GACGGCAGCG GGGCATCCCG GCACTACCTT CATCGATGTG
GGTAATGCCG GCATTTTCGA TTACTTCAAG TATGTGTTCC GCTGGGGGGA CGATGATCAC
GACGGAGAAG ACGAACGGTA CTATGCTGAA AAAACAAAAA CACACAAGCT TGCCTACAAG
GAGTGGAGAA CCTATCAGAT GTCGGATCAT CTGCCCATGT GGATAGAGTT GAGAACCGAT
TTCGGTACCG ACTACCTTTC GGCGGTTTCC GCCTCCGACT GA
 
Protein sequence
MPVYSSLRIR ENDAGHIKAW KKRTAEKLLL LRAALDTHIA DASGEESAER ACERNGRQWL 
RLATWNIREF DTLKYGGRLK ESLYFIAEII SHFDIVALQE VREDLACLQS VVQFLGQHEW
DYIATDVTEG SSGNRERMVF IYQKNRVRFT SIAGEVMLDK GDLVTDSSGL CFRDASGLKV
EFPEGVTLLP SGDVPVIKRK GKVLLEDDLV IPLPAGTRIV LPEGSSLVLP GGTQLPVENG
QVALDAASHQ AWSPHALVRP PYDLLSGIGL QFARSPFLVT FQAGWLKFIL CTVHIYYGTG
KEGLARRNEE IRKLTRFLSR RAESEHDSDA ENFFFVLGDF NIVGKKHVTW ESLHSNGFRV
PEQLQKIPAG SNAARDKAYD QIAFWQPTAA GHPGTTFIDV GNAGIFDYFK YVFRWGDDDH
DGEDERYYAE KTKTHKLAYK EWRTYQMSDH LPMWIELRTD FGTDYLSAVS ASD