Gene Clim_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1096 
Symbol 
ID6355738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1198898 
End bp1200079 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content51% 
IMG OID642668713 
Producthypothetical protein 
Protein accessionYP_001943144 
Protein GI189346615 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000465883 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACCA TCCATCTCAA ACCAAAAGAA CAACGGCGAC TGCTTAAAGG ACACCTGTGG 
GTGTTCAGCA ATGAACTGCA GACCGTTCCC GGCGATATTG CCGCAGGCGA AACCGTCCGT
CTCGTCACCC ATGACGACCG GCTTATCGGC ACCGGATTCT ATAATCCGCA CTCGCTCATA
TCTTTCCGCC TCCTTTCCCG CAGCGGAGAA ATTCCTGACA AAGAGTTTTT CAAAAAGAAA
ATAGCCGAAG CACTCGCACT TCGCGAAAAA ATCTACAGGT TGGACGACAC CAATGCATGG
CGTCTTGTGC ATGGCGAATC GGACGGCCTG CCTGGACTCG TTATCGACCG CTTCAACCGG
GCGATTGTTC TGCAGGCTTT TTCAGCGGGA ATGGACCGGC ATCTTCCTCT TGTCTGTGAC
GTCATCGAAG AATTGCTGCA GCCTGAAGCG ATCATACTGC GTAACGAATC GGTACTGCGG
GAACTTGAAG GACTCCCGCT CTACAAGGAA ATTATCAAGG GCGAACGGTC AGCGACCCTG
CAGACCATTC ATGATGCAGG CATCAGCTAT GAAGTGGATC TTTACGAAGG GCAAAAAACC
GGATTTTTTC TCGATCAGCG TGAAAATCGC AAAATCATCA GATCTTTTTC AGAAGGAGCC
GATGTCCTGG ATGTATTTAC AAACGATGGC GGATTTGCTC TCAATGCGTT GCGCGGAGGA
GCCCGTTCGG CCATTATGGT AGATATTTCA GAAGAGACCC TGAAACGGGC CGAAAAAAAT
GCCGTATTGA ACGGATTTGA AAACTTCAGC CTTGTTGCCT CGGATGCGTT TGATATGCTG
GGTAAAATGG TGGAGGCAAA AGAGCTTTTC GATGTAGTGG TACTCGATCC GCCAAGCTTC
ACCAAAAGCC GAAAAAACCT GCCGACGGCA CTTAAGGCCT ATAAAAGACT CAACAAGCTC
GGGCTGCAGC TGATTAAACC CGGCGGCTTC CTTGCCACAG CCTCATGCTC GCACCATGTA
AACGAAGAGG ATTTTCTTGC TGCTATTCAT CAGGCCGCGC TTGCCGCCGG AAAACAGCTG
CGGATGATCT ATAAAAACGC CCAGCCACCC GATCATCCGG TACTGCTCTC TATGCCTGAA
ACCGGCTATC TGAAATTTGC CTGTTTCTAT GTGACCGGCT GA
 
Protein sequence
MQTIHLKPKE QRRLLKGHLW VFSNELQTVP GDIAAGETVR LVTHDDRLIG TGFYNPHSLI 
SFRLLSRSGE IPDKEFFKKK IAEALALREK IYRLDDTNAW RLVHGESDGL PGLVIDRFNR
AIVLQAFSAG MDRHLPLVCD VIEELLQPEA IILRNESVLR ELEGLPLYKE IIKGERSATL
QTIHDAGISY EVDLYEGQKT GFFLDQRENR KIIRSFSEGA DVLDVFTNDG GFALNALRGG
ARSAIMVDIS EETLKRAEKN AVLNGFENFS LVASDAFDML GKMVEAKELF DVVVLDPPSF
TKSRKNLPTA LKAYKRLNKL GLQLIKPGGF LATASCSHHV NEEDFLAAIH QAALAAGKQL
RMIYKNAQPP DHPVLLSMPE TGYLKFACFY VTG