Gene Cpha266_0196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0196 
Symbol 
ID4570577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp221628 
End bp222833 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content49% 
IMG OID639764796 
Producthypothetical protein 
Protein accessionYP_910687 
Protein GI119356043 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACTA AAAGAATTAT TACGAAGGAA GATGTGCATT TAAAAGCAAG ACTGCTTTCC 
GAAGGAGCAA AGGTAACGGT CAACAAGCCG CCGAAGACAG GTTTCAATCC GTTTCGCGCC
ATGGTGCTCA ATGGAAGTGA TCTTGCAACT CTTGTTCGCC CGGAACCGTA TACCCGCCTT
GAGGTACAGG TTAACGGGGA CGATGTCGAA TTTTATGATT GCGGCAAACA CCTTGCAAGC
GGCAGAATGC AGGAGTGGTT CTCATGGCGC GACGGTACGT TGAGTAACGG ACGCCCGGTT
AATGCTGCCG TAATAGGAAT GAACCAGGAT ATTATCAATA TTCATTACAG TTACAGTTGC
GATAACAACA ATACAGGTCG GTCATGCAGA TTCTGCTTTT TCTTTGCCGA CCAGCATGTA
TCTGTAGGGC AGGACCTGGC AAAGATGCCG TTCCAGAAAA TCGAGGCTCT TGCAAAAGAG
CAGGCAGAAG CGGTTAAAAT AGCAACCGAT AACGGGTGGA GGGGGACTCT TGTTGTTATC
GGAGGTCTTG TTGCGCCGGA ACGTCGTTCG CAGGTTGTCG ACCTTGTTGA GATTGTCATG
GCGCCCTTGC GTGAACAGTT GAGTCCGGAA GTGTTTAATG AACTGCATAT AACGGCTAAT
TTGTATCCGC CGGATGATTT TAAGGATATG GAGCGCTGGA AGGCTTCCGG TATCAATTCA
ACAGAGTTTG ATCTCGAGGT TACCGATCCA GACTATTTCA AGGCGATCTG CCCGGGGAAA
TGTGCAACAT ATCCCCTTGA ATACTGGCAT GCCGCCCAGG ATGCCTCAGT TGAGATTTTC
GGGCCAGGAA GGGGGACCAC AAGCTTTATA CTCATGGGTC TTGAACCTAT GAATTGTATG
CTGGAGGGTG TTGAAGAGCG CCTGTCGAAG GGGGTATATC CCAATATGCT GGTTTACCAG
CCTGTGCCGG GAGCTGATAT GTTCAGAATG CCTCCGCCAA ACGCCGATTG GCTCGTTGAG
GCTTCAGAGA AACTGGCAGA CCTGTATTTC AAGTATCAGG ATCGTTTTGA TATGCCCCTT
GCTACGGACC ACAGGCCCGG TTATACGCGA ATGGGACGTT CCCAGTATAT TATGCTGACA
GCTGATGTAA TTGCACGCAG ATTGTATGAA CAGGGTTATG AACTTCCTGC AGCATATCCG
GTTTAA
 
Protein sequence
MSTKRIITKE DVHLKARLLS EGAKVTVNKP PKTGFNPFRA MVLNGSDLAT LVRPEPYTRL 
EVQVNGDDVE FYDCGKHLAS GRMQEWFSWR DGTLSNGRPV NAAVIGMNQD IINIHYSYSC
DNNNTGRSCR FCFFFADQHV SVGQDLAKMP FQKIEALAKE QAEAVKIATD NGWRGTLVVI
GGLVAPERRS QVVDLVEIVM APLREQLSPE VFNELHITAN LYPPDDFKDM ERWKASGINS
TEFDLEVTDP DYFKAICPGK CATYPLEYWH AAQDASVEIF GPGRGTTSFI LMGLEPMNCM
LEGVEERLSK GVYPNMLVYQ PVPGADMFRM PPPNADWLVE ASEKLADLYF KYQDRFDMPL
ATDHRPGYTR MGRSQYIMLT ADVIARRLYE QGYELPAAYP V