Gene Cpha266_1326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1326 
Symbol 
ID4570904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1518585 
End bp1520078 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content50% 
IMG OID639765915 
Productpentapeptide repeat-containing protein 
Protein accessionYP_911781 
Protein GI119357137 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID[TIGR02145] Fibrobacter succinogenes major paralogous domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCACC GGAGTTTCAA ACAACTCTTG CAGCAATTAC GCGACCGAAA CCGGGCGCCC 
CGCAAATGGT GGTATCCTGT GCCAAAAACA GATCCCCGTC ACCGGGATGT CATCGAGCAG
TTGCATGAGA ACCACTCGAA AACGATCAAC AAAACCATGT TTTCGCTGCT CGGCGTCGGG
CTCTACTGCC TCCTCAAGGT TCTTGGCGAA TCCGACAAAT CGCTTATCGT CGCAATCACA
ACCATTCAGA CGCCGCTGGT TGGCACCTCC ATCTCGTTTC AGAGCTTTCT TCTCATAGCG
CCCGTTCTGC TTGTCATCCT TGTAACATAC CTGCACATTC TCTACGGCTA CTGGCTGCAA
CTCGAACGGA AACGGAAGGA GATAAACGAA GCGGCAGAGA GAGAGGGTGC GCCTACCATC
GAGAGCATTC CCTCTCTTTT CAGCTTTCCC GATCGCCTGC CGCGCCTCTT CACCAACCTG
ATATTTTACT GGTTCGTACC ACTCACGCTG TGGATTATGG CCAATAAAAC TTTTGCCCTC
AGTGAATTAC GTTATCCTCT TTTTATGATC GCATCCGTTG TTACCCTTGC GATGCTGTTT
TTACAGATCT ACCGTTGCCA ATCGAAGCGG TGGCTCCGGA ATTCGCCCCG ATGGGTAGCC
TCAGGAGTAC TCTTGCTTTA TATGGTATAC GTTTCCTCCA ATCCCGAATC CCTCCGCAGA
CCCCTGAATC TTCAGCGGGA GGATCTTCAT GGATCCTGGC TGCAAGGTCT CGATATGGCT
GGTGCCGATA TGAATAACGC CAATCTTCAG GGAGCAAACC TTTCAGGGGC TGATTTGCGG
AATGTCAATC TTCAGAATGC CAACCTTCAG GAAGCCGATC TCAGAAACTC GAAATTGCAG
GGAGCTGATC TGAGATACGC AAAGTTTCAG AAATCCATTA TAGGGAATGC TGATTTCGAA
GGGGCGGAGC TTGACCATGC AGACTTCAGG GATGCAAAAG AGAATGATCC CAATCGGTTC
AAGGCTGCCA ATAACTATAA ATGCGCATTT TTCAGCGAGG GTTTACTTTC AGAACTATCT
CTTTCGCCAA CCCACAATCA AGACCTTGAA AGAATTGGTA CTTACTATCG CGGGGGAAAA
ATTGCCTATA TTTTTCAGCC GGGTGATCAG GGTTATATCG AGGGAGAACA GCATGGGGTG
ATTGCTGCTA TAACGGATCT TCCGGGAGAA GACAAATACA CCTGGGATGC GGCAATAAAA
GCCTGTGACG AGTTGGCAGA AAACGGTTAC AACGACTGGC GATTGCCGAG CCAGGACGAG
TTGAACCAGC TCTATCTCAA CCGGAGTGCT GTTGGCGGTT TTGCTCCCGG CTTCTACTGG
AGTTCTACGG AGAACGCTGC GTTCAACGCA TGGCTACAGA ACTTCGACGA TGGGTTCCAG
CTCGACTTCA TCAAGAACCT CGAGTGGCGG GTGCGGCCTG TCCGGGCTTT TTAA
 
Protein sequence
MPHRSFKQLL QQLRDRNRAP RKWWYPVPKT DPRHRDVIEQ LHENHSKTIN KTMFSLLGVG 
LYCLLKVLGE SDKSLIVAIT TIQTPLVGTS ISFQSFLLIA PVLLVILVTY LHILYGYWLQ
LERKRKEINE AAEREGAPTI ESIPSLFSFP DRLPRLFTNL IFYWFVPLTL WIMANKTFAL
SELRYPLFMI ASVVTLAMLF LQIYRCQSKR WLRNSPRWVA SGVLLLYMVY VSSNPESLRR
PLNLQREDLH GSWLQGLDMA GADMNNANLQ GANLSGADLR NVNLQNANLQ EADLRNSKLQ
GADLRYAKFQ KSIIGNADFE GAELDHADFR DAKENDPNRF KAANNYKCAF FSEGLLSELS
LSPTHNQDLE RIGTYYRGGK IAYIFQPGDQ GYIEGEQHGV IAAITDLPGE DKYTWDAAIK
ACDELAENGY NDWRLPSQDE LNQLYLNRSA VGGFAPGFYW SSTENAAFNA WLQNFDDGFQ
LDFIKNLEWR VRPVRAF