Gene Cphamn1_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2043 
Symbol 
ID6375736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2204618 
End bp2206723 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content54% 
IMG OID642684534 
Productthiol-disulfide interchange protein-like protein 
Protein accessionYP_001960434 
Protein GI189500964 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein
[COG4233] Uncharacterized protein predicted to be involved in C-type cytochrome biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.476424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.478834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGTTAT CATGTATCGC ACTCTGTCTT CTTGCATGCA GCTCACCGCT TCATGCGGGA 
TTGTTCGGTG ACGATCAGCC TGTTGAAGCT GAAGCGTTTC TTGTCGGGAA TTCTTCCGGG
GAAGGGCTTC TTGCGGCTGT TCATCTGAAG ATAGAAGAGG GCTGGCACGT CTACTGGAAG
AACCCCGGAG AGGCAGGGAT GCCTGTGGAG ATAGTATGGG AGCTTCCCGA GGGGTTTCGC
TCACTTCCTG TCGAGTATCC TTTTCCGGAG CGGTTCGAGT CAGGGGGGAT TACAGGGTTT
GGCTACAGGG ATGAGGTTGT CCTTTTTTCC AGAATTGTTC CTGAAAAACC ATCTGATAAC
CGTATTTTTC CCCGAAGCGG CTTTCCCCTG AAGGCTGAAG TGTCATGGTT GTCATGCAAG
GAGGTCTGTA TTCCGGGTTC GGTTTCTCTG GATCTGGAAG CGGATTCCTC AAGTCAGGCG
AACAGGGATC TCGCCGCGAA GTTTCTCGGG AGGATTCCCC TGAAGGGGGA TGCGTCACTT
GTCGTAGAGC GTATTGCCGC AGTGCGAGAG GACGGCTCCG GTTTTCTTGA AATTGATTTT
TCCGGTCCGG ATGCCCGGAA CGTGAAAGAT TTCTTTCCCC TGTATCCGGA AAAAGGTATT
GACATCAAAG GCACTACGGT TCAGGGCGGG TCGGTACAAC TGCCCTTGAG CGCTGAAGAG
GCTCCCGAAC GCCTCAAGGG TGTTGTCGTC ACAGAGCGCA GCGCGTATGT TGTCGATGCC
GGGGTTGATT CCGGGGAGGC GGTTCCGGCC TCATCGGCTG TTTCCGGTTC GCTTCCCGTA
ATGCTCGGAC TTGCGTTTTT GGGGGGGATT CTCCTGAACG TCATGCCGTG TGTGCTGCCC
GTCATCGGGT TGAAGGTGTT CAGTCTTGTC GGCAGTGACC CGTTGGCTTC AGGCGGGCAT
TCGCGAAAAA CCGGCCGCGT TCACAGTCTT GTCTTTGCTG CGGGAGTGCT GTTGTCTTTC
TGGGTACTTG CCGCCTTTGT CTGGGGGCTG CAGGGTATGG GGCAGCAGAT AGGATGGGGA
TTTCAGTTTC AGTCTCCCGT ATTCGTCATG TTCATCGCCG CGATTGTTTT TGCTTTCAGT
CTGAACCTTT TCGGTCTCTT TGAACTCGGC GCTCCGGTTG TGTCCGGTAA AATCGGCAGC
GTCGCCTTGC ATCACGATGT TCTCGGGTCG TTTGTCAGCG GCGTTCTGGC AACCACCCTT
GCGACACCAT GTACGGCTCC TTTTCTCGGA ACCGCTCTCG GATTTGCCTT CGTTCAGCCC
GTATGGGTGG TCTTTCTTTT TTTTACGGTT ATCGCCGTGG GTATGGCCGC TCCCTATGTG
ATACTTGCAT GGCATCCTGC ATGGCTGAAG CTTCTTCCGA AACCGGGACA CTGGATGTTT
GTCTTTAAAC AGTTGATGGG TTTTATCCTT GTTGCCGTGG TGGTATGGCT TGCCGCTATA
CTCAACGCGC AGGCCGGAAG CGACGGCATG TTCAGTCTTC TTCTTCTGTT GTTTGTCATT
GCCTTTTGCC TGTGGATTGT CGGAAGTCTG ACGGCTCACG GAGCTTCAAT GCGGAAACAG
CTTGTCGTCT GGGCAGCGGC TCTTGTTTTT ATGACAGGAG CATTTCTGAT ACTCATATCC
GATATCGGGA GCGGAGCAAA ACAGGGGATT GAGGCTGACC GTTCGGGATT GAGCGCAGCC
GGTGACAACA ACGGTGCGTT GTGGCAGGAG TTTTCCCCTT CCCTCTTCGA GGAACTGCTT
GAAAAGAAAA AAACGGTCTT TCTGGAATTT ACGGCCGACT GGTGTATAAC CTGCAAGGTG
CTTGAGGCAA GCGTCCTGGG TAATAACGAT GTTGTCGAAG CGCTGCATCG ACCGGATGTC
GCGGCAGTCA GGGCTGACTG GACGTCAAGG GACGATGCGG TCACTGCGCT GATGCAGCGG
TTCGGCAGAT CCGGAGTCCC TCTGTTTGTA ATCATTCCTC ACGGTGAACT TGATCGCGCT
GTTGTCCTGC CTGAAGTGGT GACAGTCGAT ATGCTGCTCA GGGAACTTGA ACGGGCGAGA
GAATAA
 
Protein sequence
MLLSCIALCL LACSSPLHAG LFGDDQPVEA EAFLVGNSSG EGLLAAVHLK IEEGWHVYWK 
NPGEAGMPVE IVWELPEGFR SLPVEYPFPE RFESGGITGF GYRDEVVLFS RIVPEKPSDN
RIFPRSGFPL KAEVSWLSCK EVCIPGSVSL DLEADSSSQA NRDLAAKFLG RIPLKGDASL
VVERIAAVRE DGSGFLEIDF SGPDARNVKD FFPLYPEKGI DIKGTTVQGG SVQLPLSAEE
APERLKGVVV TERSAYVVDA GVDSGEAVPA SSAVSGSLPV MLGLAFLGGI LLNVMPCVLP
VIGLKVFSLV GSDPLASGGH SRKTGRVHSL VFAAGVLLSF WVLAAFVWGL QGMGQQIGWG
FQFQSPVFVM FIAAIVFAFS LNLFGLFELG APVVSGKIGS VALHHDVLGS FVSGVLATTL
ATPCTAPFLG TALGFAFVQP VWVVFLFFTV IAVGMAAPYV ILAWHPAWLK LLPKPGHWMF
VFKQLMGFIL VAVVVWLAAI LNAQAGSDGM FSLLLLLFVI AFCLWIVGSL TAHGASMRKQ
LVVWAAALVF MTGAFLILIS DIGSGAKQGI EADRSGLSAA GDNNGALWQE FSPSLFEELL
EKKKTVFLEF TADWCITCKV LEASVLGNND VVEALHRPDV AAVRADWTSR DDAVTALMQR
FGRSGVPLFV IIPHGELDRA VVLPEVVTVD MLLRELERAR E