Gene Cpha266_1193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1193 
Symbol 
ID4570061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1349493 
End bp1350695 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content44% 
IMG OID639765787 
Productinternalin-related protein 
Protein accessionYP_911653 
Protein GI119357009 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0638484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGCAGA ACACCTATAA CAAATGCCCG GTATGCAGTT TTCCGCTCTC CCTTGAAAGC 
GCTGTATGCC CGAGGTGCGG TAATGACATT CTTGAAGATA TCTCTTCTCT TGATCAGCAA
AGCGAGGAGC TTCATCGAAA AACAATGGAT GAAAAAAAAG CCGAATGGTA CACCTGGTGT
ATAACTGAAA ATCTCAATAT TTCCGATAAC GAACTATCGA CAAAACCTCC TGATAGAGAG
AAGACATCAG AGTCCAGACA TCTCTTCTCT ACTCCTGATG AACAGGAATT GCTCCGTACC
GCATCAAAAG CGATCCTTCT GAAGGATCAT TCGTTACGAA AAAAATGGTG GCAAGCGCTC
AGTGCCGACT GGAAAGAGGT TATTAAAAAC ACCATAAAAA TAGTTCGCGA ACCCAATGAT
CAGGAAATCC TTGATTTTTT TCAAACCACT CATTTCCGAT GTGACAACAG AAGGATTCAC
GACCTCTGGC CAATACGAAT ACTGGAAAAT CTTGTACAAC TTCGTTGCGA TGAATCGCCG
GTTGAAAGTC TTGAACCCCT CGCGCATCTC AGCTCGCTGC AACGAATCTA TGCCTTTGAT
TGTGATTTCT CCTCGCTTGA GCCCCTTCGC AAACTGAAAC ATCTGAAATT GCTCTGGATA
TCGAGCACCC AGATAAGTAA CCTTGACCCT CTCCGTGAAC TTACCAGTCT TGAGGAGCTG
TACTGTTCGG AAACCCTTGT AAAAGATCTG GATGCACTGG CGGAACTGGT CAATCTTGAA
AAGCTCAGTT GTTATAAAAC CGAAATCGCA TCTATTGAGC CTCTGGCAAA TCTTTCGAAC
CTTATTGAGT TGGGCATCAA CAACTCGAAT GTCACCGATA TCCGACCGCT TTCAAAACTT
ACCGGTATTG AATACCTGCG GTGCAATAAA ACAGGAATCA TGAATCTTGA ACCACTGGCA
AACCTTGCAG GACTCAGAGA ACTGAGTATT TCAAGAACAC GGGTGGAGAG CCTTGAACCT
CTTGCGGAAC TTATGGAGCT TGAAGAACTT GATTTTTCAA ATACGGAAGT ACAATCCATT
CTCCCCCTCA TGCAACTCGA AAAACTTGAA AAAATCGAGC TCTCTGCAGG AACGGTTCCT
GAAAAAGAAC TGGAAAGATT TATTGAATTG CATCCTGATT GCGAAATTCT TCTGACGCAA
TAA
 
Protein sequence
MEQNTYNKCP VCSFPLSLES AVCPRCGNDI LEDISSLDQQ SEELHRKTMD EKKAEWYTWC 
ITENLNISDN ELSTKPPDRE KTSESRHLFS TPDEQELLRT ASKAILLKDH SLRKKWWQAL
SADWKEVIKN TIKIVREPND QEILDFFQTT HFRCDNRRIH DLWPIRILEN LVQLRCDESP
VESLEPLAHL SSLQRIYAFD CDFSSLEPLR KLKHLKLLWI SSTQISNLDP LRELTSLEEL
YCSETLVKDL DALAELVNLE KLSCYKTEIA SIEPLANLSN LIELGINNSN VTDIRPLSKL
TGIEYLRCNK TGIMNLEPLA NLAGLRELSI SRTRVESLEP LAELMELEEL DFSNTEVQSI
LPLMQLEKLE KIELSAGTVP EKELERFIEL HPDCEILLTQ