Gene Cpha266_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2066 
Symbol 
ID4569499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2399324 
End bp2401234 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content54% 
IMG OID639766647 
Producthypothetical protein 
Protein accessionYP_912502 
Protein GI119357858 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGTA TCATTGATGA AATCCGGGCC GAAGTGTTGA AGCCGGCAGA AAAGTTTTCT 
TCTGGCGGGG TTTTTGAACA AACCGACGTT CACTATGTTC ACCACGTTCA CCAATCCGAG
ACGGACGAGC AGGAACTTGA GCGTTGCGTT ACCGTCGTTG AGGATGTTGT CCGGCGCGGC
AAGGATGAGC CGGGGTTGTA TGGGAGTGAT GCGTTCAGGG CGGCATGGAC GCATATCTGT
AAGAGTTCCG ACGAGCAGCG TTTCCGGTTG AGGGTTAAGA TCAAGGAGAA CAAGCCTATC
GGCGTTCCGT TGGGTGAAAT CGACAGGGTT GGCTCTGGAG GCGAAGGCGG AGAGGATCGA
AGCGCGGCGG ACGAGTTGGT ATCATTGGCC GTTGGAGCCG GGGAGTTGTT TTTCGACGAG
ACGACGAGGG ACGGCTTTGT TACCGTCGGA AGCGATACGA TGAGGGTACG GAGCGTCGCG
TTTCTTGATT GGTTGAGTTA CCGCCATTAC CGCGAATCAG GTGGCTCTTC TGCAAGTGAT
TCTGCATTGA AACAGGCTTG TGGGACGCTA TCGGGGTTAT GCCAGCATGA AGGGAAGCCG
GAGCGGGTTT TTCTGCGGGC GGGGCGGAAC AGCGAGACCG GCGCGTATTA CCTGCACGGG
ATCGGGAAGA ATGGTCAATC GGCAGAAGTG ACGGCAACAG GGTGGCGTTT GCTCGAAGTC
GCGCCAGTGA AGTTCTGGCG CTCAGGTTCG GCAGTGCCGT TCCCTGAGCC TATCGCAGGC
GGAGATATTG GCAAGCTTTG GGATCTGGTC AACATTCCAG AGGCTGACAG AGTGCTTGTT
CTTGCGTGGA TTCTTGAGGC ATGGCGACCA GATACTCCAT TCCCGGCTCT TGAGTTGTGC
GGGGTTCAGG GATCGGCCAA ATCAAGTACA CAGAAGCGGC TTCGGATGTT GGTCGATGCA
AGTTCTGCAC CATTGAGAGC AGCACCAAAA ACGGTTGAAG ATGCCTTTGT TTCTGCCGGG
AATAACTGGG TTGTGAGTCT GGATAACGTT TCCCATTTGA GTGCGGGCCT GCAAGATGCC
TTCTGTATTA TGGCTACAGA TGGCGCGTAT GCTGGGCGGA CGCTGTTCAC CAATGCGGAT
GAGACTGTAA TAACGATCAA AAGGCCGGTA GTGCTGTCAG GGATCAATAA CCCTGTGACA
GCGCAAGACC TCATTTCAAG GACTGTTCAT ATCGAGCTGC CAGTTATCGA GGGTCGGCGG
CGGCTTGAGT CTGAGCTTGA TGCAGCGTTC AACGAGGCAT GGCCGGAGAT ATTTGGCGGC
TTGCTTGACC TGTTCGTAAA AACTCTCAGG ATGTTGCCTG AAACGGTACT CCCTGTGGAT
GCGCTCAGGA TGGCCGATTT CAACCATCTT GGCGAGGCGA TGGCGCGAGC TATGGGGTAT
AAGCCAGGCG CGTTCACTTC GCTTTATCGG GGCAATTACC GCGAGTCCGT TCAACGAGCG
ATGGAGTCAA GTCCGGTGGC GGTTGCGGTC GCGCAAATGG CTGAAGATTC ACCTTTGCAA
GAGGTGTTCG ATGGGACGTA TAAGGAGCTG CTCGAAAAAC TCGTAACGCA CAAGACGGAT
AATGAGGGAT GGCCGAAGAG TGAGAAGGGG TTAGCCAATG CGCTCAAACG GCAGATGCCG
GCATTACAGG AGATTGGTGT TGTCCTGATT CCTGAGACCG GGCCGAAGCA GAGCAAGAAA
GGCCGAAGGA TCGTTATTCG AAAGGGTGAA CATGGTGAAC ATAGTGAACA TGGAAACGTC
CAAAAAGCCC CGGAAGAAAA AAAATATCCG ATTGGATGCG GGTCGGACGG GTCAGACGGG
TCAAATGACC CATGCGAGCC AACCGGCACA AGCGATGACA TGCCGTTCTG A
 
Protein sequence
MTSIIDEIRA EVLKPAEKFS SGGVFEQTDV HYVHHVHQSE TDEQELERCV TVVEDVVRRG 
KDEPGLYGSD AFRAAWTHIC KSSDEQRFRL RVKIKENKPI GVPLGEIDRV GSGGEGGEDR
SAADELVSLA VGAGELFFDE TTRDGFVTVG SDTMRVRSVA FLDWLSYRHY RESGGSSASD
SALKQACGTL SGLCQHEGKP ERVFLRAGRN SETGAYYLHG IGKNGQSAEV TATGWRLLEV
APVKFWRSGS AVPFPEPIAG GDIGKLWDLV NIPEADRVLV LAWILEAWRP DTPFPALELC
GVQGSAKSST QKRLRMLVDA SSAPLRAAPK TVEDAFVSAG NNWVVSLDNV SHLSAGLQDA
FCIMATDGAY AGRTLFTNAD ETVITIKRPV VLSGINNPVT AQDLISRTVH IELPVIEGRR
RLESELDAAF NEAWPEIFGG LLDLFVKTLR MLPETVLPVD ALRMADFNHL GEAMARAMGY
KPGAFTSLYR GNYRESVQRA MESSPVAVAV AQMAEDSPLQ EVFDGTYKEL LEKLVTHKTD
NEGWPKSEKG LANALKRQMP ALQEIGVVLI PETGPKQSKK GRRIVIRKGE HGEHSEHGNV
QKAPEEKKYP IGCGSDGSDG SNDPCEPTGT SDDMPF