Gene Clim_2357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2357 
Symbol 
ID6355703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2586367 
End bp2587530 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content54% 
IMG OID642669949 
Producthypothetical protein 
Protein accessionYP_001944359 
Protein GI189347830 
COG category[S] Function unknown 
COG ID[COG3876] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAGAA CAGGACTGGA TATGCTGCTG CAAAACATCG ACAGGTACAG CCGAAGACGC 
ATCGGTCTGA TTGTCAATCA GACATCGGTT ACCTCTGATC TGCGCTATTC ATGGGATATA
CTCAGAAAAA AAGGGCTGAA CATCAAGCGG ATTTTCTCAC CTGAACACGG ACTGTTCGCC
ACCGAGCAGG ATCAGATCGC CGTCGGCGGC CAGCCGGAAT CGGCATGCGA ACTGGTAAGC
CTCTACGGCA GTTCCGCGGA AACCCTCATT CCCGACAGGA GCCTGCTTGA CGACCTCGAT
CTCATTCTTT TTGACATTCA GGATGTCGGC TCCCGCTACT ACACCTATGT CAACACACTG
GCACTCTTCA TGGAAGCCGC ATCCGGCAGG GATATGGAAA TCATGGTACT TGACCGCCCA
AATCCGCTCG GAGGGACAAT GGTCGAAGGC CCTCTGCTCG ATCAGGCATT CCGATCCTTT
GTCGGGATTT TCCCCGTCCC GGTACGTCAC GGCATGACGG CAGGAGAGCT CGCCCTTCTC
TACCGTGAAT GGAAGAAAAT CGATATCAAG CTCACGGTCA TGACCATGAA GGATTGGAGC
CGCAGCATGC TTTTCCCCGA AACCGGCCTG CCATGGATCC CCCCTTCACC GAACATGCCG
ACATTTGCAA CTGCTGAAGT TTATCCGGGC ATGTGCCTTT TTGAAGGACT CAACGTTTCG
GAAGGCCGAG GCTCGACAAC CCCTTTTCAG CTCTTCGGCG CGCCATTCAT CAACCCGTAC
GATCTTGCGG AAGCATGCAA GGGTGCCGGA TTCGAAGGCG CACTGCTCCG CCCGACCTGG
TTCAGACCGA CCTTTCATAA GTTCAGCGAA ACTGTTATAG GAGGCGTCTG GCTTCATGTC
GCCGACAACC GCCGCTTCCG TCCATTCGCC GCTGGCGTAG CCCTTACTGC CGCTCTGCAT
AAGCTCTATC CCTCCAGTCT GGAGTTCCTC AGCGGAGTAT ATGAATTCAA CGACACCATT
CCGGCGTTCG ACCTGCTTGC CGGAAACAGT GCCCTTCGCA GTTCGATACT CGACGGATGC
TATACCGAAG CCCTTATCGA TTCCTGGCAG CATGACGAAA CTGAATTTGC CCGAACAAAA
GAGCGTTACC ACCTTTACAA ATGA
 
Protein sequence
MVRTGLDMLL QNIDRYSRRR IGLIVNQTSV TSDLRYSWDI LRKKGLNIKR IFSPEHGLFA 
TEQDQIAVGG QPESACELVS LYGSSAETLI PDRSLLDDLD LILFDIQDVG SRYYTYVNTL
ALFMEAASGR DMEIMVLDRP NPLGGTMVEG PLLDQAFRSF VGIFPVPVRH GMTAGELALL
YREWKKIDIK LTVMTMKDWS RSMLFPETGL PWIPPSPNMP TFATAEVYPG MCLFEGLNVS
EGRGSTTPFQ LFGAPFINPY DLAEACKGAG FEGALLRPTW FRPTFHKFSE TVIGGVWLHV
ADNRRFRPFA AGVALTAALH KLYPSSLEFL SGVYEFNDTI PAFDLLAGNS ALRSSILDGC
YTEALIDSWQ HDETEFARTK ERYHLYK