Gene Clim_0537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0537 
Symbol 
ID6354888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp604962 
End bp605978 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content52% 
IMG OID642668173 
Productpentapeptide repeat protein 
Protein accessionYP_001942608 
Protein GI189346079 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATC AGGAACATCT GACCGTTCTT CGGCAAGGAG TTGCATCATG GAACAGGTGG 
CGCCTTGAAA ACAGCGGTAT TCAGCCCGAT CTGAGCGGTG CCGACCTGCG GGGCCGCGAA
CTTCAGGATG CGGATTTCAG CGGTACCGAC CTTCGCGCCG CCGATCTCAC CGGAGCCGAT
CTTCGTGGTG CAAGGCTCAG CAAGAGCACT ATCGATATTC ATACCCGTTA CGATACTATC
CGGGGTTGTG ATATCGGAGT GAACGGATTC TATTCTCCGG CTACCGATTC CGCAGCCCTC
ATGCGTCTCG ATCCTCCGGG AAACTCCATG CAGGGGTCCA ATGCGGAGGC TGTGATCGAA
AGTCTCAAGC ATGCCAGAAA ACTGCATACC TTTTCCATGA TTCTGGCCGG TATCGGTCTT
TTGTTTATCG TCATCAGGCC TAAATCCATT TCCCTTCCAT ACCTTGCCGG ATCGTTCAAG
TTCGACGATC TCAGCTACGC TTTTCTTGCT GCGCTGCTCT CCACCTCTCT GCTCAGTCTT
GTCGCGACCT TTATCGATTC CGCACTGCAG GGGGCGCACT ATCTCAACGA CCGCCGTTCA
GCCATGACGG TAGGTCACTT TCCCTGGTTG CTTTCCAAAT ATGAACAGGA GGGGGCATTC
AGACGCCAGT CTAAAGTCAT GCGTTTTTTT CTCAGTTTTC ATCCGCTGGT TTACCTGTAC
TTTTTTGTCA AATGGGATGC CCTTTTTCTT GGCGACTGGT ACGGAGTGAT AAGGCACTAT
CAGGAACTTC CGGTTATTCT CGGGGAGTGG CTTCTTCCGG TTTTTCTGGT CATTCTTGTA
CGGCTCTGTA TGAAAATTTT CAGACTCTCG GAAGGATTTC AAAAACCTAT TCTTTTCGAT
ACGGTAACGG AGAGAGAACG GCGTACCGAT ATGGAGAGGC TTGCCCAGGC AGTCGAAAAA
CAGGCTGTGG AAATCTCGGC ATTGACCGCA CTGCTGCGCC GGGAAAAAGA ACGGTAG
 
Protein sequence
MADQEHLTVL RQGVASWNRW RLENSGIQPD LSGADLRGRE LQDADFSGTD LRAADLTGAD 
LRGARLSKST IDIHTRYDTI RGCDIGVNGF YSPATDSAAL MRLDPPGNSM QGSNAEAVIE
SLKHARKLHT FSMILAGIGL LFIVIRPKSI SLPYLAGSFK FDDLSYAFLA ALLSTSLLSL
VATFIDSALQ GAHYLNDRRS AMTVGHFPWL LSKYEQEGAF RRQSKVMRFF LSFHPLVYLY
FFVKWDALFL GDWYGVIRHY QELPVILGEW LLPVFLVILV RLCMKIFRLS EGFQKPILFD
TVTERERRTD MERLAQAVEK QAVEISALTA LLRREKER