Gene Cphamn1_1595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1595 
Symbol 
ID6375273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1721221 
End bp1722264 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content39% 
IMG OID642684083 
Productprotein of unknown function DUF21 
Protein accessionYP_001959997 
Protein GI189500527 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.333457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTCT TACTAGTATA TTTTTTTCTT GCTTTAGGTG TTTCTTTCAT GTGTTCAATA 
TTAGAGTCGG TTTTGCTTTC TACTAATATG TCATATGTTT CGATTCTTGA GAAAGAAAAG
CCTAACGTTG GCGCTCTTTT AAAACATCAC AAGGTTAACC TGAATAAATC CATTGCGTCT
ATTTTAATTC TAAACACAAT CGCCAACACG TTGGGGGCTG CGGGAGTGGG TGCACAAGCC
GCGCATATAT ATGGTTCTGA TATAGTCGTC TATGTTTCTG TTGTTCTTAC GTTCGGTATT
CTGTTTTTTT CTGAGATAAT ACCGAAAACA ATCGGCGCGC TTTACTGGAA AGAACTCTCA
ACGATAACCG CTTACGCGAT ACATGTATTT ATCTGGATTA CCTACCCGTT AATATACCTT
ACCTTGTTTG TGACGGATAA AATGTCGAAA AGCAAGAAAA ATGTAAGCAG GATGAGCAGG
GCAGAGCTGC TTGCATCGGC ATTATTAGGG GAACATGAAG GCGTTATCGA TGAAAAAGAG
TCGGATGTTA TCGAGAATGT CCTGAGGCTT GACGAAATCA AGGTCAAGGA TATATTGACT
CCGCGCAGTG TTGTTTTTGC AGTGGAAGAG AACCGGAGCA TAAAGGATAT CGTCTCAAAC
GATAGTGAAA TATTTAATTT CTCACGAGTG CCTTTATATA AGGAAAACAT GGACACTATT
GTGGGAATAG CGTTAACCAA GCAGATATTT GAGCAGGCCC TGAAGGATGA TAGTGTTTCG
ATTAAAGAGA TAAGCAATAC AATATTTCAT GTCAATGAAA ATGTTCCTGT ATCAAAAGCG
CTGGATCTTT TTGTCAAGAA AAAAGAGCAT ATGTTTTTAG TTGTTGATAA TTATGACCAG
ACAGAAGGTA TCGTTACTCT GGAAGATTGT ATCGAGACAC TGTTAGGAAT TGAAATTATG
GATGAAAGTG ATGATGTCGA AGATATGCGT GAGTGGGCAA AACTCAAGAT GAAGATAAAG
CGGAAGCAAA AGCAAAAAGA ATAG
 
Protein sequence
MTLLLVYFFL ALGVSFMCSI LESVLLSTNM SYVSILEKEK PNVGALLKHH KVNLNKSIAS 
ILILNTIANT LGAAGVGAQA AHIYGSDIVV YVSVVLTFGI LFFSEIIPKT IGALYWKELS
TITAYAIHVF IWITYPLIYL TLFVTDKMSK SKKNVSRMSR AELLASALLG EHEGVIDEKE
SDVIENVLRL DEIKVKDILT PRSVVFAVEE NRSIKDIVSN DSEIFNFSRV PLYKENMDTI
VGIALTKQIF EQALKDDSVS IKEISNTIFH VNENVPVSKA LDLFVKKKEH MFLVVDNYDQ
TEGIVTLEDC IETLLGIEIM DESDDVEDMR EWAKLKMKIK RKQKQKE