Gene Clim_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1898 
Symbol 
ID6354952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2101157 
End bp2102251 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content43% 
IMG OID642669496 
Producthypothetical protein 
Protein accessionYP_001943910 
Protein GI189347381 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.521574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAT TATTCGAATT AAAGAAATAC TACAAACATG AAGGACTGAT TCAGTCTCTC 
ATTCATTTGT ACTGGCTGGT TTCTGCCAAG ATAAGTTATA AGCTTAACGA ACGCAAAGAA
CGCCGAAAGT GGGAACAGAT TCCGGTGGAT ACCAGTAAAC GGGTCTTTGT AATTGGTAAC
GGACCCAGCC TGAACATTAC TCCTCTGCAT CTGCTGGATC AGGAACAGAC TATTTGTTTT
AACCGTTTTA CGCTCTTTTT AGATCGGATA CAGTGGAACC CGACCATGTA CATGATCATG
GATGGTTTGG TCGGAAAAGA TATTATTGAG GATATCAAAA CGATGGTCGA TCGTACCCAA
GTCTCTTTCG TTCCGGCTTT CGTGCCAAAA TACCGGGTCA ACTTCAAAAA GCATATTAAG
AGCGAAAAGG TAAGATGGGT CTACCAACGT GGAAAGAAAA TCGAGCTGGC AGATCCGCCC
TACGTGAATG TAAGCAACTC GGTTGCTGTA ACCGCGCTGC GTATTCTGAT CAAGCTTGGT
TTCAAAGAAA TCTATCTGAT CGGAATGGAT ATGAACTATC AGATCCACAA AACCGCCAGC
ACGCTTAAGA ACAACGATAT ACAATCTGTC AAGAACGACG ATCCGAATCA TTTTGATCCC
CGCTACTTTG GTAAAGGAAA GAAGTATCAT CAGCCCAATG AGGAAGTAGT GCAACGTATC
TTTAACTCAC TGACCGAGAT TGGTACACTG GCCGATAAAT ACGGCTCCCA AATCAGGAAT
GCAACTCTTG GAGGTATGCT GGAAGTATTC CCCCGGATAG ATTTAAGAAG CCTGTTCCCG
GAGTTTGAAG CTGAAGAATT TGTCAAACTG CAGGAGTTGA TCAAATTCCG AGCTGGATTC
GAGCTTGAAT CAGCGGATCA CTGGGACAGT ATTCCCCAGG TTGATTCAAT TGATGCGGTC
AGTGAGCATT TGGAAATGTT TCGGGTCGAT ACTGAGCTTA CACATTCATT CCTGAATAAA
TTCATATTTG ATTATAACCT GTTCGGACCA TTCCGGCATC AAAAACTATT CATTAAAAGA
AAGCAAAATG GCTAA
 
Protein sequence
MGKLFELKKY YKHEGLIQSL IHLYWLVSAK ISYKLNERKE RRKWEQIPVD TSKRVFVIGN 
GPSLNITPLH LLDQEQTICF NRFTLFLDRI QWNPTMYMIM DGLVGKDIIE DIKTMVDRTQ
VSFVPAFVPK YRVNFKKHIK SEKVRWVYQR GKKIELADPP YVNVSNSVAV TALRILIKLG
FKEIYLIGMD MNYQIHKTAS TLKNNDIQSV KNDDPNHFDP RYFGKGKKYH QPNEEVVQRI
FNSLTEIGTL ADKYGSQIRN ATLGGMLEVF PRIDLRSLFP EFEAEEFVKL QELIKFRAGF
ELESADHWDS IPQVDSIDAV SEHLEMFRVD TELTHSFLNK FIFDYNLFGP FRHQKLFIKR
KQNG