Gene Cphamn1_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0347 
Symbol 
ID6374008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp359303 
End bp360565 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content53% 
IMG OID642682866 
Producthypothetical protein 
Protein accessionYP_001958796 
Protein GI189499326 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTAC TGGATAAATT AGAGATACTT TCCGGTTCGG CCCGGTACGA TGCTTCCTGT 
GCTTCGAGCG GCAGCAGCAG AGGCGGCGGA GGTGAGGGTA CGCTCGGCAG CACGTCAAAA
GGAGGGATCT GTCATTCATG GTCTGATGAC GGTCGCTGTA TCTCCCTGCT TAAAGTTCTT
TATTCCAATG ACTGCAGCTA TGATTGCGTC TATTGCGTCA ACCGCAGATC AAATCCTCAT
CCCCGAACGT CGTTTACCGT GCATGAACTG GTTGAACTGA CGATCAGATT CTATCGCAGA
AACTATATCG AGGGCCTTTT TCTGAGCTCC GCTGTCATGC AAAGCCCTGA ATCTACAATG
GAGAGCATGG TCGAGGTGAT CCGGAAACTT CGGGTAGAGG AGTCCTTTGC CGGTTACATA
CACATGAAAG TCATTCCGGG ATGCTCTGAG GATCTGGTTC GAAAGGCAGG ATTCTATGCC
GATCGTCTCA GTGTCAATAT CGAGCTCCCT TCCGGTGATT CATTGAAGCT GCTCGCGCCG
CAGAAACAGA GAGAGGATAT TCTCAAGCCG ATGGCTTGCC TCGGCGACGC GATCATCGCA
AGCCGTAAAG AGAGAAAGAA GAACCGCAAG GCACCCGCTT TTTCTCCCGC AGGCCAGAGC
ACGCAGATGA TCATCGGTGC TTCTCCCGAG TCGGATTTCA GGATTCTGAG CCTTTCGCAG
GGGCTCTATA AACAAATGCA TCTCAAACGG GTCTACTATT CCGCGTTTGT TCCTGTGAAC
AGTGACAACC TTCTGCCGGT TCATGCAAAA CCGCCTTTGC AGCGCGAGCA TCGTCTCTAT
CAGGCCGACT GGCTGCTCCG CAACTACGGC TTCACGGCTG ACGAGATCCT TTCGGAAGAG
TCTCCTTTCC TTGATGAGCA TCTTGATCCT AAAGCGTCAT GGGCGTTACG CAATCCCGGG
TTTTTTCCGG TCGATGTCAA CAGGGATGGT TACTTCGCGT TGCTGAGGGT TCCGGGAATC
GGTGTGACCT CCGCAAAGCG GATTGTTGCA GCGCGAAGGT TTGCGGTGAT AACTCCGGAG
GGGCTGAAGA ATATCGGGGT CGTCATGAAG CGGGCAAGAT ACTTTATCAC CTGTTCCGGC
AGGCCGGTGG AGCGTTTGTT CGACAGACCG GCACTTGTTC GCCGGAAACT GCTCATCGCT
GAAACCGGAA AAGACCCCCG CGCACTGAAG CAGCGGCAGC TTGATTTTTT TAGTAACAAG
TAA
 
Protein sequence
MDVLDKLEIL SGSARYDASC ASSGSSRGGG GEGTLGSTSK GGICHSWSDD GRCISLLKVL 
YSNDCSYDCV YCVNRRSNPH PRTSFTVHEL VELTIRFYRR NYIEGLFLSS AVMQSPESTM
ESMVEVIRKL RVEESFAGYI HMKVIPGCSE DLVRKAGFYA DRLSVNIELP SGDSLKLLAP
QKQREDILKP MACLGDAIIA SRKERKKNRK APAFSPAGQS TQMIIGASPE SDFRILSLSQ
GLYKQMHLKR VYYSAFVPVN SDNLLPVHAK PPLQREHRLY QADWLLRNYG FTADEILSEE
SPFLDEHLDP KASWALRNPG FFPVDVNRDG YFALLRVPGI GVTSAKRIVA ARRFAVITPE
GLKNIGVVMK RARYFITCSG RPVERLFDRP ALVRRKLLIA ETGKDPRALK QRQLDFFSNK