Gene Cpha266_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1053 
Symbol 
ID4571015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1195433 
End bp1196746 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content53% 
IMG OID639765656 
Producthypothetical protein 
Protein accessionYP_911524 
Protein GI119356880 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACG GGATGATGGT TTTGCTTCTT GTTCTGGTGT TCGGGCTTCT TTTTTTTATC 
GTTTTTCGCA TTCTGCGCGA TGCTCCCCTC AAAGAAGAAC TGCACCAGCT CAGGGTTGTC
GAGCGGGAGC TTCGCTCTCA GGTGGATGAG CTAAAGGCAA AAACCGGGGA GCTTGATATA
TTGAAAGTTA TCCGTGCCCG TCTCGAATCA GATCTTGACC ATGAGCGCAG CAATGCATTG
GAGAAAATTG CGCTTCTGCA GCAATCGGAA TTACGACTGA AAACAGAGTT CGAGCATCTT
GCCGGGCGTA TTCTTGAAGA GCGTGGAAGC TCGCTTGGAG AGGAGAACCG GGTTAGAATG
GCTTCACTTC TGCAGCCGCT TAAAGAGCAG CTCGATGCAT TCCGCACGCG AGTCGATGAG
GTACATCGAA ACGATACCGA GATTTCCGCC CGACTTATCG AGCAGGTACG ACAGCTCCAG
GAGCTCAGCG GGCAGGTGAG CAGAGAGGCT AATTTACTTG CCCGGGCTAT CAAGGGCGAG
AGTAAAGCAC AGGGCGACTG GGGAGAACTG ATCATTGAAA GGATCTTTGA GGCTTCGGGG
CTTGAAAAAG GGCGGGAGTA CACCGTACAG GAGAGTTTCA GGATGGAGGA TGGTACTCTG
AAACGGCCTG ATTTTATGGT TCTCCTTCCG GGTGAAAAGG CCGTTATAGT CGATTCAAAA
GTCTCTCTGA CGGCCTATGA ACGCTATTGC AGCCTTGATG ATGTTGCCAG GCGGGAGCAG
GCTCTTCGGG AGCATGTTCA ATCGGTGCGC CGTCACATAG CCGGGTTGCA GGAAAAGGAG
TACAGCTTTA TCAAGGGGAA TCGTACGCTT GATTTCGTCA TCATGTGCAT TCCCGTGGAA
CCGGCATGGC AGGCTCTCAT GCAGGCAGAC CCGGAGATCG TATACGAACT TGGCAGAAAA
AACGTGGTGC TGACCGGCCC GACCACGCTG ATGATCACCC TGAAGCTTAT TGCGCAGCTC
TGGCGGCGCG AGAAAGAGAA TCGTAATGCC GAGGTTATTG CCGAAAAGGC CGGTCGGATC
TACGATCAGG TTGTTCTGAT AGTCGAAGCC ATGGAGGATG CACGAAAAAA ACTTTCGGGC
GTCTCCCAGT CATTTGATCT TGCCATGAAA CGACTCACGG AAGGACGGGG GAGTCTGGCG
TCGAAGGTTG AGGAAATCCG TCGGCTTGGG GCAAAGGTCA GCAAACAGCT TCCCGGGGGT
TTTGACGATA ACGAAGAGAG CGAGAGCGTC AACGGGAATA GCTCGGCCTT CTGA
 
Protein sequence
MSDGMMVLLL VLVFGLLFFI VFRILRDAPL KEELHQLRVV ERELRSQVDE LKAKTGELDI 
LKVIRARLES DLDHERSNAL EKIALLQQSE LRLKTEFEHL AGRILEERGS SLGEENRVRM
ASLLQPLKEQ LDAFRTRVDE VHRNDTEISA RLIEQVRQLQ ELSGQVSREA NLLARAIKGE
SKAQGDWGEL IIERIFEASG LEKGREYTVQ ESFRMEDGTL KRPDFMVLLP GEKAVIVDSK
VSLTAYERYC SLDDVARREQ ALREHVQSVR RHIAGLQEKE YSFIKGNRTL DFVIMCIPVE
PAWQALMQAD PEIVYELGRK NVVLTGPTTL MITLKLIAQL WRREKENRNA EVIAEKAGRI
YDQVVLIVEA MEDARKKLSG VSQSFDLAMK RLTEGRGSLA SKVEEIRRLG AKVSKQLPGG
FDDNEESESV NGNSSAF