Gene Cpha266_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1474 
Symbol 
ID4570244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1672249 
End bp1673382 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content54% 
IMG OID639766060 
Producthypothetical protein 
Protein accessionYP_911925 
Protein GI119357281 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCAAC CCTCAATGTT TAATCATATG ATGACTCGCA GACGGTTTAT TATCTCCTCC 
ATGGCGGCTA TTGGCGGTTT GGCGACCGTA TCGGCATGTT CACCAGAAAA AAATCCCGGC
AGCTACATGG ATGTCGCAAG CGGTATCTGG CGTCACGGAA AGGTGGAGGC CGGGAACAGG
TCTGCGGTTC TGCGTGAGCT TGTGCGATAT GCGACACTGG CGCCATCAAG CCACAATACC
CAGTGCTGGA AGTTTCGCAT TGATGATCGT TCTATCTCGA TTTTCCCCGA TTTCTCGCGC
AGATGCCCCG TAGTCGATCC GGACGATCAT CACCTGTTTG TATCAATCGG GTGTGCGATG
GAGAACCTTA TACAGGCGGC ATCAGCAAAC GGGCTTGATG GTAATGCGGT TTTCGATCCG
TCCTCGCGCG GGAATGTGCG TGTTTCGCTG GAACCAACGA ATGCCGTTGT TACCCCTCTG
TTCAAAGCGA TACCGGAGCG CCAGAGCACA CGAGCCGAGT ATGACAGGAA GCCGATTTCC
ATGAATGAGC TGGCGATGCT GGAAAGGGAG GGTACAGGCA AAGGTGTCCG GATTATTTTT
CTCACTGAAC GCGCGGCAAT GGAGAGCCTG CTCGATTATG TTGTTCAGGG TAATACTGCG
CAAATGAACG ACAGCGCATT CGTTGAAGAG CTGAAAGCAT GGATACGCTT CAGTGAGAGC
GATGCGGTAC GCAGAGGAGA CGGCCTGTAT TCTGCCTCGT CGGGGAATCC ATCCGTGCCT
TCGTGGCTGG GCAGCCTCCT GTTCGGTTTG TTCTTTACAG AGAAGAACGA GAACGACAAG
TATGCGAAGC AGGTGCGCAG TTCAGCAGGT ATCGCGGTGT TTGTATCGGA GGGCGAGAAC
CCTGAGCAAT GGATAGAAGT CGGGAGATGC TACGAACGGT TTGCGCTTCA GTGCACGGCT
TTGGGGATAC GCAATGCCAT GCTCAATCAA CCGGTGGAAG TTGCTGCACT GAGGCCGCAG
TTCGGGGCCT TTCTCGGTAT CGGGGAGCAC AGGCCGGATC TGGTCGTACG ATTCGGACGT
GGTTCGGGAT TGCCGCAGTC ATTGCGACGT CCGGTTGAAG ATGTTCTGGC ATGA
 
Protein sequence
MPQPSMFNHM MTRRRFIISS MAAIGGLATV SACSPEKNPG SYMDVASGIW RHGKVEAGNR 
SAVLRELVRY ATLAPSSHNT QCWKFRIDDR SISIFPDFSR RCPVVDPDDH HLFVSIGCAM
ENLIQAASAN GLDGNAVFDP SSRGNVRVSL EPTNAVVTPL FKAIPERQST RAEYDRKPIS
MNELAMLERE GTGKGVRIIF LTERAAMESL LDYVVQGNTA QMNDSAFVEE LKAWIRFSES
DAVRRGDGLY SASSGNPSVP SWLGSLLFGL FFTEKNENDK YAKQVRSSAG IAVFVSEGEN
PEQWIEVGRC YERFALQCTA LGIRNAMLNQ PVEVAALRPQ FGAFLGIGEH RPDLVVRFGR
GSGLPQSLRR PVEDVLA