Gene Cpha266_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1048 
Symbol 
ID4571010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1187162 
End bp1188424 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content50% 
IMG OID639765651 
Productpentapeptide repeat-containing protein 
Protein accessionYP_911519 
Protein GI119356875 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0897049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTCTA AAACGATACG TTTTTTTTTA CCCATCCTTC TCAATGCGGC TTTGATCCCG 
GTTTTTTCTT GTCGCGTATC AGCATTTAAC CCATCACATC TTGATTCGCT CAATGCCGGA
GTGAAATCGT GGAACAACAT GAGAACCCTG CATAAAGATT TTACCCCTGA TCTTTCAGGT
GCAATACTCA AGGGTCGCAA TCTCAGAGGA GCCGATTTCC AAAACGCCAA TTTTTCAGGT
GCCGTGCTGA CCGATTCCGA TCTCAGCAAT GCGAACTTGC GGAATGCTTC GCTTGACGGG
GCAAGGATGA GTGGAGCGCT ACTGATTCGG GCGGATTTTC AGGGTGCCCG CATGCATGCC
GTCGATCTTG AGGGCGCAGT GCTTGATGGC GCTCATCTTC AAAAAGCAGA GCTTCAACAA
TCGATTCTGC GAAAAGCCGA TTGTTCCAAT GTTGATTTTT CAGATGCGGA TCTTCGCGAC
TGCAATTTTC GGGAGGCGTC GCTGGCCAAC GCAACTCTAA TCGGTGCGGA TTTACAGGCG
GCATATCTCT GGAGGGCCAA TTTCAGCAGG GTAAAGCTCC GTGGCGTCAG GGTATCAGAT
GCAACTATTC TTGATACCGG ACGATATGCC ACCGAGGAGT GGGCCAGAGA TCGTCAGGCA
GTTTTTCTAT CGGCATCCCC ATCTGTTGAT CCATTGAAAG CTTCTTCTGA CGCATCTTCT
GCAGCAGTAG AGGGTGGTTC GAAAAATGCG CGTTCACAAG GGCGCTCTTC TCCAGCTCAT
CTTGTCCGGA ATGCTGCTTC AACGGCAAAT ATCTGGAGAA AAGCCGAGAT TCAATCGGCG
GTTTTGTATG ACAGAAAGCA GTACGAGCAA CTTAAGCGCA ATGTTTTCGA CTGGAATAAA
ACGAGAAAAC AGAACAGCGC CATGCGTGTT ACGCTTCATG GTGCTGATTT TGATCATAAA
AATCTCAGTT ATGCTGATTT AGCAGGGGCC GATCTTGCAG CCTCCACGTT CAAGGGTGCT
GATCTTGAAG AGAGCGATCT GAGAAAGGCT GATCTCAGTG GGTGCGATTT TCGGGAAGCG
AGTTTGCGTG GAGCCGACCT TGGTGGAGCT GATCTGAGGG GCGCCAATTT CTGGCGGGCA
AATCTCGACC GTATTCGTCT TGATGGTGCT GTTGTTTCCG CTGCAACTGT GCTTGATTCA
GGTAAACATG CTACGTCTGA ATGGGCTGTT CGCTTTGGCG TTACATTTGC AGAAGAGAAG
TGA
 
Protein sequence
MRSKTIRFFL PILLNAALIP VFSCRVSAFN PSHLDSLNAG VKSWNNMRTL HKDFTPDLSG 
AILKGRNLRG ADFQNANFSG AVLTDSDLSN ANLRNASLDG ARMSGALLIR ADFQGARMHA
VDLEGAVLDG AHLQKAELQQ SILRKADCSN VDFSDADLRD CNFREASLAN ATLIGADLQA
AYLWRANFSR VKLRGVRVSD ATILDTGRYA TEEWARDRQA VFLSASPSVD PLKASSDASS
AAVEGGSKNA RSQGRSSPAH LVRNAASTAN IWRKAEIQSA VLYDRKQYEQ LKRNVFDWNK
TRKQNSAMRV TLHGADFDHK NLSYADLAGA DLAASTFKGA DLEESDLRKA DLSGCDFREA
SLRGADLGGA DLRGANFWRA NLDRIRLDGA VVSAATVLDS GKHATSEWAV RFGVTFAEEK