Gene Clim_0733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0733 
Symbol 
ID6356014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp801700 
End bp803148 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content52% 
IMG OID642668358 
Productprotein of unknown function DUF404 
Protein accessionYP_001942793 
Protein GI189346264 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGCC TGTTTGAACG TTATGAAGCC TGTCCTGACA GGTTTTTCGA TGAGGTTCTC 
TTACCGGAAG GAAAACCGCG GGAACATTAC GACAAGATGA TTCACCGCTT CGGTCAGTTT
TCTACCGAAG ACATCAGAAC CCGCCGGCAG GTTGTCAATG TCTTTTTTCG CAATCAGGGA
ATCACCTTTA CGGTTTATGG CGTCGATGAG GGTATCGAAA GAATCTTTCC ATTCGATCTC
ATTCCAAGAA TCATTCCTTC TGATGAATGG CAGCGTATCG AGCGAGGACT CATTCAGCGG
ATCACAGCAC TCAACGAGTT CCTGGCAGAC ATCTATTCGA ATCAGAAAAT CCTGAGAGAC
AAGGTCGTTC CGGCAGAACT GGTGTTGGGA AGCAAACATT TCATAAGGGA GTTTATCGGG
GTCAAACCTC CGCTTGGCGT CTATATACAT GTTACGGGAA GCGACATCAT CCGTGACGCG
CAGGGCAGAT ATATGGTGCT GGAGGACAAT CTGCGAACAC CAAGCGGCGT CTCCTATATG
CTGCAGAACC GTCAGGCCCT GAAGCGGGCA TTTCCGGTAC TCTTCGATCG ATACAAGGTA
AGACCCATCG ACAACTACCC GCAGGAGCTG CTGCGCACCC TGCAGGAAAT CAGCCCGACG
TCGAGACCTG AGCCCAACGT CGTTGTACTT ACACCGGGCA TTTACAACTC CGCCTATTTC
GAGCACAGTT TTCTTGCCCG GCAGATGGGC GTCGAACTTA CCGAAGGTCG GGATCTCGTG
ATCAACAACA ACAAGGTTTA CACCCGCACC ACAAAGGGAC TGCAGCGTGT GGACGTCATC
TACCGCCGAG TTGACGACGA TTTTCTCGAT CCGCTGGTCT TCCGACCCGA CTCGAAGCTC
GGAGTGGCCG GCCTCATCAA CGCCTACCGC AAAGGAAACG TGGCGCTGGC CAACGCCATC
GGAACCGGAG TGGCTGACGA CAAGGTTATC TACAGTTTCG TACCGAAAAT GATCCGGTAC
TATCTCAACG AAGATCCGAT TCTTGATAAT GTCGACACCT GGCTCGCCAA CAATCCCAGG
GATCTTAAAT ATATTCTCGA AAACCTCGAC AAGCTTGTAG TCAAGTCCGC AAACGAATCA
GGGGGATATG GCATGCTTGT CGGGCCGGAA TCGACTCTGG AAGAACGCGA ACGGTTCGCC
GAAAAAATTG TCGCCGATCC ACGAAACTAT ATTGCACAGC CGACCATATC GCTTTCAAGA
CACCCAAGCT TTTTCAACGA TTGCGAACTC GCCGGCTGTC ACATCGATCT CAGGCCTTAC
GTGCTGTATG GCAAAACACC GACCATCGTA CCCGGCGGTT TAACCAGGGT TGCGCTCAAG
CGGGGCTCTC TTGTCGTCAA CTCATCTCAG GGAGGAGGAA GCAAGGATAC CTGGGTAATC
GATGAATAA
 
Protein sequence
MISLFERYEA CPDRFFDEVL LPEGKPREHY DKMIHRFGQF STEDIRTRRQ VVNVFFRNQG 
ITFTVYGVDE GIERIFPFDL IPRIIPSDEW QRIERGLIQR ITALNEFLAD IYSNQKILRD
KVVPAELVLG SKHFIREFIG VKPPLGVYIH VTGSDIIRDA QGRYMVLEDN LRTPSGVSYM
LQNRQALKRA FPVLFDRYKV RPIDNYPQEL LRTLQEISPT SRPEPNVVVL TPGIYNSAYF
EHSFLARQMG VELTEGRDLV INNNKVYTRT TKGLQRVDVI YRRVDDDFLD PLVFRPDSKL
GVAGLINAYR KGNVALANAI GTGVADDKVI YSFVPKMIRY YLNEDPILDN VDTWLANNPR
DLKYILENLD KLVVKSANES GGYGMLVGPE STLEERERFA EKIVADPRNY IAQPTISLSR
HPSFFNDCEL AGCHIDLRPY VLYGKTPTIV PGGLTRVALK RGSLVVNSSQ GGGSKDTWVI
DE