Gene Cphamn1_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2052 
Symbol 
ID6375745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2214369 
End bp2215580 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content44% 
IMG OID642684543 
Producthypothetical protein 
Protein accessionYP_001960443 
Protein GI189500973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000379131 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0792841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATAA CCGAAAACCC GAACTGGCGA ACCCTTCATA AAGACAGTAA CAACGGGTAT 
CAGAAAGTCG TTAAACGTCT CGGAAATGAC GTTATTCTCT ATAGCATAGA AACAGATCAT
GACATTTCTC TTGACACCAT GAACATCGAC ATGCTTCAGA CTGTACTGCA GGATTCGAAG
ATCGGGAACA AGCCTGTCTG CCTCTTATGG AACATGAAGC ACATTACCAA TATTTCGCTG
ACGTATAAAA AACAGATTGC CAACCTTATC TATAACCGCA GAGTACACTT CGGCATTGTC
GTCTTCTTTA ACGTGGAGCC CGTGTGCATG ACACTGGTCC AAACGTTTGC CGCAATGGTA
CCTGAAGATA TGACGGTACT CATCAAGCAA AACTATACCG AAGCTGTAAA CACGACTTTG
GCCTGGAAAG AAGGATTACC TGTTGACACA ATCTATGAAA GTGCCGAGGA AGAGAAATAC
GAACTTCAGA AAAATGAGTT TCTTGCTGCA CTTGCCAGAA TTTCCTGGCT TGACATGATG
GAACAGAGCA TTCCCATGCC GTCAAACGAC GACAAACTGC TCCCCTTTTT CCAGGCGATC
AGTCATCTTC AGAGCGATCT TCTGGAAATA TCACGCAATA AGGAACAGGA ACTGAGACAG
ATTGAACAGG ACGGCGAAAA AACTCTTACC GAAAAAAATA TTCTGCTCAA CGCACAGAAG
GAACTGTATA AAAAGCTCAA AAATCAGCTG GAAAAGGAAA AATCGGCACT CACAGCAAGA
ATCGCCACGC AGGAGATGGA GCTTACGAGG ATATCTACAG CGGTTGTTGA AAAAACATCG
GCCCTTCGCC AGCTTCTCGA CCTGATCACC ACGCTGGATA TCGACCAGAG TCAAAAAAGA
ACCATGATCG ATATATGCTC AAACATGATC GACACCGAGC TTATAGAAAA GAGACTCAAT
ATTGAGCTTA CGACAACTGA TTCCGAGTTC CTGTCAAAAC TCCAGAAAAA ACACCCTAAC
CTGAACCAGC GGGAACTACG AATCTGCCTG CTGATAAAGC TGAATTACAA CACAAGGGAT
ATCGCGCGTT CAGTGGGTAT TTCTACCCGG GGAATGGAAA GCATTCGCTA CAGAATGCAC
AAAAAAGTAG GGCTGTCAAA ACACCAGTCC CTTAAAAGCT ATCTCACTGA ACTGATCATG
CAGAGAGACT GA
 
Protein sequence
MEITENPNWR TLHKDSNNGY QKVVKRLGND VILYSIETDH DISLDTMNID MLQTVLQDSK 
IGNKPVCLLW NMKHITNISL TYKKQIANLI YNRRVHFGIV VFFNVEPVCM TLVQTFAAMV
PEDMTVLIKQ NYTEAVNTTL AWKEGLPVDT IYESAEEEKY ELQKNEFLAA LARISWLDMM
EQSIPMPSND DKLLPFFQAI SHLQSDLLEI SRNKEQELRQ IEQDGEKTLT EKNILLNAQK
ELYKKLKNQL EKEKSALTAR IATQEMELTR ISTAVVEKTS ALRQLLDLIT TLDIDQSQKR
TMIDICSNMI DTELIEKRLN IELTTTDSEF LSKLQKKHPN LNQRELRICL LIKLNYNTRD
IARSVGISTR GMESIRYRMH KKVGLSKHQS LKSYLTELIM QRD