Gene Clim_0810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0810 
Symbol 
ID6353880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp887304 
End bp888506 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content50% 
IMG OID642668434 
Productinternalin-related protein 
Protein accessionYP_001942869 
Protein GI189346340 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.448764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGCAGA ACACCTACAG CAATTGCCCG ATATGCGGTT TCCCGCTCTC CTCGGAGAGT 
GCGGTCTGCC CCCGGTGCGG TAACGATATT CTCGAAGACA TCAACTCTCT TGACGAGCAG
AGCATGGACC TTCATCGTCA CAATATCGAA GAAAAAAAGG CTGCCTGGTA CACGCGCTGC
ATAACGGAAA ATCTCGGGTT CTGTGAAAAT CCGGTTGAAG AGTCATGCCC CGATACAGCG
AAAATATCCG GAACACGACA CCTCTACTGC AGTTCGGAAG AACGTGAATT TCTGGGTACC
TGTAACAGGT CCTCCCTTGT CGATGACAGC TCGCTCCGCA GGAAATGGTG GAACTGTCTT
ACAGCCGACT GGAAAGAGGT GGTCAAAAGC ACCATAAAAC TGGTACGCGA TCCCTCCGAG
AGCGAACTTC TCGATTTTTT TCAAACCACT CATCTGCGCT GCGATAATCG CCGCGTGCAC
GATCTTCTCC CGGTACGCAT GCTCGAGCAT CTCCAGCAGC TGCGCTGTGA TGAATCGCCG
GTGGAGAATC TCGAACCCAT TGCGAATCTC ATCCATCTGC AGCGTCTCTA TGCGTTCGAC
TGTGATATCG CGTCTCTCGA ACCTCTGCGC AATCTCCGGA ATCTGAAACT GCTCTGGATA
TCGAGTACTC AGATAACATC GCTGGAGCCA TTGAAAAATC TGGTCAATCT TGAAGAACTG
TACTGTTCGG AAACCATGAT TACCGACCTC TCACCCCTGC AATCGATGCT CTCGCTTGAG
AAGCTCAGCT GCTATAAAAC GGAAATCACC AATCTCGATC CCTTGAGATC TCTTGAAGAT
CTCATCGAAC TCGGCATCAA CAACACGGGT ATTGACGATC TGGCTCCACT TGCCGGTCTG
CGTAATCTCG AGTACCTTCG CTGCAGCAAA ACCAACATAG CAAGCCTTGA TCCTCTAAAA
AATATCATCG GGCTGAGAGA ACTCAATGTC TCAAAAACAA AGATATCCTC GGTCGAACCG
CTTGCAGGTC TCGTTGATCT CGAGGAACTC GATATTTCGC ATACTCTTGT ACGCTCAATA
GAGCCGCTCA TGCATCTGGA AAGTTTCGAA AAGCTCGAGC TTTTGGCAGG CCAGATTCCC
GATATGGAGA TCGAACGGTT CATTGAACTG CATCCCGGCT GTGAAGTCCT GCTGAAAAAC
TGA
 
Protein sequence
MEQNTYSNCP ICGFPLSSES AVCPRCGNDI LEDINSLDEQ SMDLHRHNIE EKKAAWYTRC 
ITENLGFCEN PVEESCPDTA KISGTRHLYC SSEEREFLGT CNRSSLVDDS SLRRKWWNCL
TADWKEVVKS TIKLVRDPSE SELLDFFQTT HLRCDNRRVH DLLPVRMLEH LQQLRCDESP
VENLEPIANL IHLQRLYAFD CDIASLEPLR NLRNLKLLWI SSTQITSLEP LKNLVNLEEL
YCSETMITDL SPLQSMLSLE KLSCYKTEIT NLDPLRSLED LIELGINNTG IDDLAPLAGL
RNLEYLRCSK TNIASLDPLK NIIGLRELNV SKTKISSVEP LAGLVDLEEL DISHTLVRSI
EPLMHLESFE KLELLAGQIP DMEIERFIEL HPGCEVLLKN