Gene Cpha266_1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1443 
Symbol 
ID4570177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1644568 
End bp1645743 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content56% 
IMG OID639766029 
Producthypothetical protein 
Protein accessionYP_911895 
Protein GI119357251 
COG category[S] Function unknown 
COG ID[COG4924] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTGGA CAACCCCTGC CGAACTCAAG GCCCAGGTTC AGAAACTCTG GGATCGGGGC 
CTCATCCTCT CCGCTCTGAC CAACGGCGAA GAACTTTTCC CCCGTCGCCT GACGCTGAAA
GGGCCCGACT CCAGAGAGTT AAGCAACTCT TTTGCCGAAG TACGCGACTG GATTATGCGA
CTTTCCGGTG CTGCCAAACA GTACCGGATT GTATGGCGCA CGGTCAATCA CCGCATTCTG
GGAGCCAATG AACTTCCGGC GGAAATCTGG ATCGATTCAC TCGACGACGC CCTCGGGCTT
ATTGGCAAAC AGCGAGAAGC CCGGCAATTT GCCGCCATGG TCACGCTCAC CCGCGACTGG
CAACCCGCAC TTCTACCCTG GCTCGCAAAA CGCCCCCTGC GAGCCCTCGA ACTGGCCGAA
GAGTGGTCGC ATATTCTCGA AATTGTCGCC TGGCGTCTCA AACATCCCCA CCCGGATATC
TACCTGCGCC AGATCGACCT GTCCGGCGTG CACAGCAAGT TCATCGAAGG GCACCGGGGC
GTACTTGGGG AGCTCTTCGA CCTCCTCCTT CCACCGGAGG AGATTGACGC AACGGTTACA
GGAGCCGGAG GGTTCTCTCT CCGTTACGGC TTTAGGGACA AACCCCTCCG GGTGCGATTC
CGAATCCTCG ACCCGAAACT GGCGCTTCTC CCGACGAATA CCGATCAGGA TATCACCCTG
ACGCAGGCAA CGTTTGCCCG ACTTGAAATC CCCGTTACAA AAATCTTCAT CACCGAAAAC
GAAATCAATT TCCTGGCCTT CCCTGAGGTT CCCGAGGCAA TGGTGATTTT CGGAGCAGGG
TATGGTTTTG AGAACATGGC TTCAGTCGAG TGGATGCGTG ACCGTGTTAT CCACTACTGG
GGAGACATCG ATACCCACGG TATGGCAATC CTCAACCAGT TACGGAGATT CTTTCCGCAG
GCCGCCTCTC TTCTGATGGA CCATGAGACG CTGATGGAGC ACCAACCGCT TTGGGGCGCT
GAACCATCTC CCGAAACCGG TACGCTCACG CGCCTGACCG CTCAAGAGGG TGCGCTTTAT
GATCAGTTAC GACGAAATGA ACTGGGCAGT CGAATTCGGC TGGAGCAGGA GAAGATCGGG
TTTGAGTGGC TGGTTGAGGC GTTAAAAAAG CTCTAA
 
Protein sequence
MNWTTPAELK AQVQKLWDRG LILSALTNGE ELFPRRLTLK GPDSRELSNS FAEVRDWIMR 
LSGAAKQYRI VWRTVNHRIL GANELPAEIW IDSLDDALGL IGKQREARQF AAMVTLTRDW
QPALLPWLAK RPLRALELAE EWSHILEIVA WRLKHPHPDI YLRQIDLSGV HSKFIEGHRG
VLGELFDLLL PPEEIDATVT GAGGFSLRYG FRDKPLRVRF RILDPKLALL PTNTDQDITL
TQATFARLEI PVTKIFITEN EINFLAFPEV PEAMVIFGAG YGFENMASVE WMRDRVIHYW
GDIDTHGMAI LNQLRRFFPQ AASLLMDHET LMEHQPLWGA EPSPETGTLT RLTAQEGALY
DQLRRNELGS RIRLEQEKIG FEWLVEALKK L