Gene Cpha266_0893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0893 
Symbol 
ID4570507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1017939 
End bp1019012 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content42% 
IMG OID639765488 
Productpentapeptide repeat-containing protein 
Protein accessionYP_911365 
Protein GI119356721 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.417756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGCC CAAAACATCT TGAAATCCTG AAACAGGGCG TTGAAGTATG GAATGAGTGG 
CGTGATCAGC ATAAAAATGT TATACCTGAT TTTAGAGGTG CCGATCTAAA ATTCATTAAT
CTTGCTAACG CCAATCTCTC TATAGCAAAT CTCAGAACAG CTAAATTCTC ATATACAAAT
CTTACAAGAG CAAATTTATC AGGTTCTAAT CTTGCTGATG CCAATCTTAC AGGTGCCAAT
CTAACGGGAG CAAATCTATC GAGATGCAAT CTTTCTATAG CCAATCTTTC AACGGCCAAT
CTTTCAAAAG CTAATCTTGA AGGAGCTATT CTTATAGATG CTGATCTCAC AAGGGCTAAT
TTTAGAGAAT CTAATCTCAT ATTTGCAAGT TTATCAGGAA GTGCCCTCAT AAAAACTGAT
TTCAGTAACG CAAGTGTCGG ATGGACAATC TTTACTTATC TTGATCTTAG CCCCTTATAC
AATTGTGCTA TCGGTCTTGA GACAATAATT CACAAAGGAC CTTCTTCTGT AGGAATCGAT
ACGATATACC AATCAAATGG AAATATCCCT GAAGTCTTTC TTCGCGGGTG TGGTCTTTCC
GATGAATTTA TTGCCTATAT CCCATCTTTA ACCGGAAAAG GTATCGAATA CTACTCGTGC
TTCATCAGCT ACAGCCATAA AGATGAAGTT TTTACTAAAC GGCTGCATAA CGACTTGCAG
GCAAACGGTG TGCGCTGCTG GTTTGCTCCG CATGACATGA AAATCGGAGA TAAAATCCGC
CCAACCATTG ACGACTCAAT CCGAGTTCAT GACAAGCTGC TTCTGATCCT TTCAGAACAC
TCAGTGCAGA GTGATTGGGT TGAGCACGAA GTTGAGCATG CTTTTGATCT TGAGAAAGAA
CGAAAACAAA CAGGACTTTT CCCGCTTCGA ATCGATGAAT CAATCATGGA GAGTACAACC
GGATGGGCAG GAAATGTGAA GCGCCAGAGG CACATCGGAG ACTTCACAAA ATGGAAACAG
CGCGACGCCT ACCAGGCCGC ATTCGACCGC CTCTTGCGTG ATTTGAAAGC CTGA
 
Protein sequence
MASPKHLEIL KQGVEVWNEW RDQHKNVIPD FRGADLKFIN LANANLSIAN LRTAKFSYTN 
LTRANLSGSN LADANLTGAN LTGANLSRCN LSIANLSTAN LSKANLEGAI LIDADLTRAN
FRESNLIFAS LSGSALIKTD FSNASVGWTI FTYLDLSPLY NCAIGLETII HKGPSSVGID
TIYQSNGNIP EVFLRGCGLS DEFIAYIPSL TGKGIEYYSC FISYSHKDEV FTKRLHNDLQ
ANGVRCWFAP HDMKIGDKIR PTIDDSIRVH DKLLLILSEH SVQSDWVEHE VEHAFDLEKE
RKQTGLFPLR IDESIMESTT GWAGNVKRQR HIGDFTKWKQ RDAYQAAFDR LLRDLKA