Gene Cag_0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0892 
Symbol 
ID3748082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1222701 
End bp1224104 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content43% 
IMG OID637773423 
Productinternalin-related protein 
Protein accessionYP_379200 
Protein GI78188862 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTTC GGCTTTTTTT CTCTATTTTT AAAGAACTTA AAAGCAGCGC GGTTCCTCAG 
AGTCCTCTTG TTCCTTTAAG TAGTGATGGA ATTTTTTGGT ACCAAAAAGC GCTACTTCCC
ATTGCAATTA AGAAAATGGA AGTGCGCCAA AGTACAAACC AGCTCGTTAT CTCCCCAACC
TTAAACGCAC TTTTAATTGT GGAGCAACAT ACCTACAACC AATGCCCAGT GTGCGGTTTT
CCGCTTTCCA TGAATAGCGC TATTTGCCCT CGTTGTGGCA ACGATATTCT TGAAGATATT
TCATCACTCG ATCAACAATC GTTGGAGAGG TATCATAAAC ATCTTGAAAA CAAAAAAGCA
GAGTGGTACG CCCGTTGCTT AACCGATCAA ATTACTGGTG GCGACAACCC ACCCTTATCA
GCCGAGCACC AAGAGTGCCC AGCAGGACGC CAAAAGCCTC ACGCTTTGTT TAATAGTGAT
GACGAGTTGG CATTTTTTAC CTCATTAAAC CGTGCCGATA TTCTGCGCGA CACAAATTTG
CGCAAAAAGT GGTGGCAAAG CATTACTGCC GATTGGCAAG ATGTGGTACG TTTCACCTTA
AAAATTAATC ACGATCCTTC CGATAGCGAC TTACTTGCTT TTTTTGATAG CACCAATTTG
CGTTGTGATG ACCGTCGCAT TCATAGCTTG CTGCCTATTC GCGTACTCGA AAAGCTTCAG
CAACTTCGTT GCGATGAATC GCCCATTGAA AGCCTTGAGC CTCTTGCCCA CCTTACCTTG
TTGCAGCGAC TTTATGCCTT TGATTGCGAC TTTACCTCAT TGGAACCGCT GCGTAACCTA
ACGCATCTTA AACTCCTATG GATTTCAAGC ACCGAAATCA CATCGCTTGA ACCCATTAGT
AACCTTATTA ATCTCGAAGA GCTTTATTGC TCCGAAACCG ACATTACCGA TTTAGAGCCA
CTTCGGAAGC TTATCAATCT CGAAAAGCTA AGCTGCTACA AAACCAGCAT TACCTCCTTA
GAGCCACTTG CTGAACTTGA AAATTTAATT GAGCTGGGCA TTAATCACTC CGATATTAAT
GATTTAACCC CTCTTGCAGG GCTTATCAAT CTTGAATACT TGCGCTGTAG TAAAACCGCT
ATTAGCAGCT TAGAACCGTT GCGCAACATG GTAGAGTTGC GGGAACTCAG CATTGCTCAT
ACCAATGTAG ATTCGTTAGA AGGCTTGCAA GGGTTAGAAA ATCTTGAAGA GCTTGATATT
ACGAACACCT TGGTAAGTTC TATTGAACCG CTTATGGGGT TGGAATACAT CGAAAAGCTT
GAGCTTTCGG TTGGCACCAT TCCTGACGAA GAGCTTGAGC GCTTTGTAGA ATTACATCCC
GATTGCAATG TTGTTGCAAA GTAG
 
Protein sequence
MALRLFFSIF KELKSSAVPQ SPLVPLSSDG IFWYQKALLP IAIKKMEVRQ STNQLVISPT 
LNALLIVEQH TYNQCPVCGF PLSMNSAICP RCGNDILEDI SSLDQQSLER YHKHLENKKA
EWYARCLTDQ ITGGDNPPLS AEHQECPAGR QKPHALFNSD DELAFFTSLN RADILRDTNL
RKKWWQSITA DWQDVVRFTL KINHDPSDSD LLAFFDSTNL RCDDRRIHSL LPIRVLEKLQ
QLRCDESPIE SLEPLAHLTL LQRLYAFDCD FTSLEPLRNL THLKLLWISS TEITSLEPIS
NLINLEELYC SETDITDLEP LRKLINLEKL SCYKTSITSL EPLAELENLI ELGINHSDIN
DLTPLAGLIN LEYLRCSKTA ISSLEPLRNM VELRELSIAH TNVDSLEGLQ GLENLEELDI
TNTLVSSIEP LMGLEYIEKL ELSVGTIPDE ELERFVELHP DCNVVAK