Gene Cpha266_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1601 
Symbol 
ID4571124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1816695 
End bp1817939 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content58% 
IMG OID639766182 
Producthypothetical protein 
Protein accessionYP_912046 
Protein GI119357402 
COG category 
COG ID 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000118956 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTTTG TGATTGATCG GCGGAGGATG TTGCCTCATC AGCGGGCATT CTGGGAGTTG 
CCGAATTTTC TGAAGGTGCT GGTTGGAGGG TATGGGTGCG GGAAGACGCA CATTGGGGCG
TTGCGGTCGA TTTATGATAG TTATGTGAAT GCGCCGGTGC CGCATTTGTA TGTGTCGCCG
TCATACAAGC AGGCTCGGAA GACAGTGGTG ATTTCGATTC GGGAGTTGCT GGACGCGGCG
GGTGTGCGGT ATCGGTTCAA TAAGACGAAT CATGAGTTTG CGATTGCGAA TTGGAATGGG
ACGATCTGGA TTGCGAGCGG TGATGAGCCT GACAGTTTGA AGGGTCCGAA CATCGGGAGC
GCGGGGATCG ATGAGCCGTT CATCCAGCAG AAGGAGGTGT TTGATATTAC GCTGTCGCGG
GTGCGGCATC CGAGGGCGAA ACATCGGGAG ATTTTTCTGA CGGGGACGCC GGAGCAGTTG
AATTGGGGGC ATGAGGTTTC GCAGAATGAT GAGGGTCGGT ATGATCTGGG GCTGGTGGTT
GGTCGGACGG CGGATAATGT GCATTTGCCG GGTCAGTTCG TTTCGATGCT TGAGCGGGCG
TATGATGAGA ATCAGCGGGC TGCGTATATG AACGGGTTGT TTGTGAACCT GACGGTTGGC
AGGGTGTACA GTTATTTCGA TCGGTCGGTG CATATGGGCG GGGCTGGCCT GGGTGGTGAT
GGTGCGGATG GCGAAGTGGT GGCGGGGATT GATTTCAACG TGGATCATTT GACGGCGGTG
GTGTTGCGGG TGTGGGGTGA CCGGGTGCAT TGTTTCGATG AGATGGTGTT GCGTGGTTCG
ACGACGTATG AGCTGGCGGA TCGGCTGTAT GAGCGGTTTC CGGGGATTCG GGTGTTTCCG
GATCCGTCGG GCGGGGCGCG GCGGACGTCG GCTCCGAAGA CGGATGTGCG GATTCTGCAG
GATAAGGGGT TCAGGGTGGA GATGCGGCCG AAGCAGCCGC CGGTGAAGGA CAGGGTGCAT
GCGGTGCAGA AGTTGTTGCG GGAGGGTCGG TTGTCGGTGA CGGGGTGCGC GTGTCTGGTT
CGTGATTTTG AGCAGGTGGT GTGGCGCGGG GGTGATATTG ATAAGGTGAC GAGGCCGGAG
TTGACGCATG CCTCGGATGC GGTGGGGTAT GCGATTGAGA AGTTGTTCCC TGTTCCGCTG
CCGGAGCGGG ATTATTGGCG GCAGCCGGAG CATTGGAGGG CTTAG
 
Protein sequence
MRFVIDRRRM LPHQRAFWEL PNFLKVLVGG YGCGKTHIGA LRSIYDSYVN APVPHLYVSP 
SYKQARKTVV ISIRELLDAA GVRYRFNKTN HEFAIANWNG TIWIASGDEP DSLKGPNIGS
AGIDEPFIQQ KEVFDITLSR VRHPRAKHRE IFLTGTPEQL NWGHEVSQND EGRYDLGLVV
GRTADNVHLP GQFVSMLERA YDENQRAAYM NGLFVNLTVG RVYSYFDRSV HMGGAGLGGD
GADGEVVAGI DFNVDHLTAV VLRVWGDRVH CFDEMVLRGS TTYELADRLY ERFPGIRVFP
DPSGGARRTS APKTDVRILQ DKGFRVEMRP KQPPVKDRVH AVQKLLREGR LSVTGCACLV
RDFEQVVWRG GDIDKVTRPE LTHASDAVGY AIEKLFPVPL PERDYWRQPE HWRA