Gene Cphamn1_1163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1163 
Symbol 
ID6374838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1251046 
End bp1252185 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content52% 
IMG OID642683662 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001959579 
Protein GI189500109 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00834081 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGCTAT TACCCTTAAA GAGTTGTCTG TCATTGTTTC TGGTCATGAT TTCCTGCAAT 
GCTTGTGCCG AGCCTGTTAG TAACAGGATT GAGGGGAATG CGGGCTCAAG TCTCCTATCC
GAGAGTTTTG GGACATTCAA TGAACCCTGG GCCATGACTT TTCTTCCGGA TGGTGATCTC
CTGGTAACTG AAAAAGGAGG AACTCTGCTT CTGGTCAAAC CGGATGACCG TTCCAGGGTT
CCGGTCCAGG GTGTACCCGA GGTGGCTTAC GGTGGTCAGG GCGGCCTTGG TGATATAATT
CTGCACCCTC GATATATGGA TAACAACTGG ATTTACCTGT CATATGTCGA GCAGGATGCT
TCAGGAAAGA GGGGAGCAGC AGTTGCCCGG GCTCGATTTC GGCCAGCATC GGCCGGGGCA
AAACTGGAAA ACCTCGAAGT CATCTGGCGG CAAGAGCCTA AAGTATCTGG CAACGGGCAT
TACTCATACA GGTTGGCATT CAGCCCGGAT GGGCATCTTT TTATCACTTC CGGTGATCGT
CAAAAACAGA CGCCGGCACA GAGCTGGAGA CAAAGTCTTG GCAAGGTGAT CAGGTTAAAC
GCGGATGGAT CAGTCCCACC GGACAATCCG TTTCAGGATA AAGGAGAACT TGCAAAAACC
TTCTGGTCTC TCGGGCATCG AAATCTTCTT GGGATTGCAT TTGACAAAAA GGGGCAACTG
TGGACACACG AAATGGGCCC CAGGAACGGT GATGAATTCA ACCTGACCAT TGGTGGAGAT
AATTACGGTT GGCCCATTGT ATCATGGGGC GATCACTACT CCGGTCTTCC AATTCCCGAT
CATGATACGC GTCCGGAATT CAATGCACCG GAGATATACT GGGTACCAAC CGTTGCACCT
TCAGGCCTGA TACTCTACTC CGGTTCTCTG TTTCCGGCGT GGCAGGGCAA TGCCTTTATC
GGAGGGCTCA GATCACAATC ACTGATACGG ATCAGGATTG AAGGGAACCA GGCGGAGGAG
GTCGAACGTT TTTCCATGGG AAAACGAATA CGGGAGGTTG AGCAAGGACC AGACGGCGCC
ATATGGGTTC TGGAGGATAC TAATGGTGGC CGCCTGATCC GCCTCAGCGA ATACAAGTAG
 
Protein sequence
MRLLPLKSCL SLFLVMISCN ACAEPVSNRI EGNAGSSLLS ESFGTFNEPW AMTFLPDGDL 
LVTEKGGTLL LVKPDDRSRV PVQGVPEVAY GGQGGLGDII LHPRYMDNNW IYLSYVEQDA
SGKRGAAVAR ARFRPASAGA KLENLEVIWR QEPKVSGNGH YSYRLAFSPD GHLFITSGDR
QKQTPAQSWR QSLGKVIRLN ADGSVPPDNP FQDKGELAKT FWSLGHRNLL GIAFDKKGQL
WTHEMGPRNG DEFNLTIGGD NYGWPIVSWG DHYSGLPIPD HDTRPEFNAP EIYWVPTVAP
SGLILYSGSL FPAWQGNAFI GGLRSQSLIR IRIEGNQAEE VERFSMGKRI REVEQGPDGA
IWVLEDTNGG RLIRLSEYK