Gene Cphamn1_0519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0519 
Symbol 
ID6374183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp544952 
End bp546019 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content50% 
IMG OID642683036 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_001958963 
Protein GI189499493 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTCC GGAAATTTAA CGGTATCTGT TTTGTCGCGC TGCTCTCTTT TTTTCTCGGT 
GGCTGCGCCT CAACCCTCGT GGGTTCCCTG GAAGAACTTC ACGAACCGCC TGTAGAAGCG
TTTTTCGTAA CCGATCGCAA CGATACCGGT TTAAAGGATC CCGCAGAGAA ATACGGTAAA
GAGCGCGCTT CGGTATCTTA CGGGATCTGC AGTGTATCCA TCCCTCCCGG TCATCGCATC
GGGAAACTCG AAAGTCCCAC ATTCAGAAAG GACGTTGAGG AGCATATCGT GCTTGTGGAT
GTGTCCGTTC TTGAAAAAAA AGATTTTTTT TCGAAGGTTT CTCATGCACT GAACCGTTCT
GGCAAGAAGA CTATGCTTCT TTACGTGCAC GGTTATAATG TGACGTTTGA AAAGGCGGCC
AGAAGAATGG TTCAGATTGT CGATGATCTT GATTTTAAGG GCATTCCGGT TTTCTACAGT
TGGCCGTCTC AGGGAAGTGT CGGAGGATAT CCTGCTGATG CAGCCAGTGT CGAATGGTCG
GAACAGAACC TTGGGGATTT TCTTGCGGAA GCTGCCCGGA TTTCGGGCGT AAATACCTTG
TATCTTTTGG CCCACAGTAT GGGAAATCGT GCCTTGACTG GCGCTTTCCT GGATCTTGTC
AGGGAAAAAC CGCATTTAAA AAGCCGTTTC AAGGCGCTGC TCCTGACCGC TCCGGATATT
GATTCCGAGG TTTTCAGAAG AGATATCGGG CCGGGTCTCG CGGCCTCAGG GGCTGCGATT
ACCCTTTACG CATCAGGCAG GGACAGGGCA TTGAGGCTCT CGAAAAGACT TCACGGATAT
CCAAGGGCCG GGGATGTAGA CGGTTTTCCC CTGATCGTTC CCGGTATTGA GACGGTAGAC
GCTACCCATG TGGATACAAG TTTTCTCGGG CACTCCTATT TCAACGGTTC GAGATCTGTA
TTGTCGGATA TGTTCTATAT TCTCAATGAG GAGCTTCGGG CGGAACAACG GTTTTCACTT
GAACCCGTTG ATACGCCTGA GGGGCGGTAC TGGAGATTCA AGGAGTAG
 
Protein sequence
MIFRKFNGIC FVALLSFFLG GCASTLVGSL EELHEPPVEA FFVTDRNDTG LKDPAEKYGK 
ERASVSYGIC SVSIPPGHRI GKLESPTFRK DVEEHIVLVD VSVLEKKDFF SKVSHALNRS
GKKTMLLYVH GYNVTFEKAA RRMVQIVDDL DFKGIPVFYS WPSQGSVGGY PADAASVEWS
EQNLGDFLAE AARISGVNTL YLLAHSMGNR ALTGAFLDLV REKPHLKSRF KALLLTAPDI
DSEVFRRDIG PGLAASGAAI TLYASGRDRA LRLSKRLHGY PRAGDVDGFP LIVPGIETVD
ATHVDTSFLG HSYFNGSRSV LSDMFYILNE ELRAEQRFSL EPVDTPEGRY WRFKE