Gene Cpha266_2263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2263 
Symbol 
ID4568485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2592471 
End bp2594162 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content51% 
IMG OID639766825 
Productsulfatase 
Protein accessionYP_912679 
Protein GI119358035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTCC CTACTTGCAG GAGCGATGTG AACAAAGGCT CCCGTTCCGT TCGCCTTATT 
GCCACGTTTG CCATTGTACA ACTGGCGCTA CATCTCTATA CAGTCGTTGA ATCACCGGAC
CCGCGTTTGT TTTTGCCGCA TTTTATAGCA TGGAGCCATG ACCTGCTGAT ACTCTCGATT
TTGTTTTTCG TCTTCAGCAG AGCAATCGCC CTGTTTCCAT CGCGGTTTCG GAATTGCGCT
GAACTCATGA CTCTGCCGAT CATCGCTCTG GCGCTTCTGC CACTGACGCT CTATCCCCGG
ATGCTTCGCG AGTACCTCTC CTTTCCAGTG AACCTGTTTA CGGCAACCCC TGCATCAGCA
TCTGCGATGC TGACCAAATA TCTTGGGCTT TCGAAACTGA TGCCGGTAGC CTTTGCGGTA
GCTTCCGCTC TGGTTGCGCT GCTGATGCTG CCGTTTCCTT CGTGGTCGAA AAAGGGAAAG
CTCTTGTTCA CGGTTTTCCG GGGAATTCTG CTTACCGTTG CAATCCTGAC ACTGCCGCGT
TCACCCCATC CGGTTGTAAA CAGCCTTAAA GAAGAGATGT CGGCTGTACT TTCTCACGAA
CGACGGGAGG TGCCTGCGCT TTTTTCCGCA CCACACCGAC AGGATAATCA GAAGCCTTCA
GGTTCCGGCG TCTTGTCGTT GCAGGAAAAA CTGAAAGCGG ACCATATCTA TCTGATTGTG
CTCGAAGGGG TGAGTGCAGA TCAGTTCGAG AACGCAATTT CCGGTACAGA GTCAAGGTTT
TATCGTCGCA TCTCCAGACA TGCCAGATAT TTCGACCGGT ACTACACGAC CAATCTCGAC
TCCTATACCA GCCTGATCGC CATGCTCACA TCCGAGCAGG TCCCGTACCG TTCTTATACC
GATACCGGAT TGTACGATGC GGTCAACAAT GCTCCTAACC TTGCACGCAG TTTTAAAGAT
ATCGGATTCC ATACTCTTTT TATCAGCATC TACGACGATC AGCCGTTCAT CCCTGTTCGT
CGGGACTGGT CGAAAATCAT GCACCGACAT GACCTTCCTG CCGGAAAACA ATGGGTCTCC
GTTGAATCAA GCCGCATGGA GTCCGCAACA GAGGACAGGG CTGCGCTTTC GACGCTGGGA
AAGCTCCCCT CGCTGTATCC GAAGACTTTC GTTTTGCACG AACTGGCCTA TGGCCACACG
ACGGAGTGGC GGGCAAAGAC AGGTATTCCA CAACTTGCTT ATTACGATAC CTATCTGAAT
GAACTGCTTG ACCTGCTCAT TGCGAATGGA ACCTGGTCAA AAAGCCTTAT GGTAATCGTT
TCGGACCATG GCGACCGGGC GAAAGGAGCG AATACCGAAA GCTATCGTGT GCCGTTGATG
ATTGTTGGGC AGGATGTGGC GCAAGGCATC GATCATACGT TTCGCTCTCA TCTGGAGCTG
CAGCAGATCA TGGTATCATC GCTAACCGGA AACACCATGC CTGAGCCAAA AAAGGAGGCG
ATTGTTGTCG GTTCAACCGA GCGCTGGATA TATGGACTGA TCGATGCTCA CGGCGATCAT
CTGGTTATCG ATGATCGTAC CGGCAAAGTT GTCGCATCGA ATGGAAAATT GAGTTCAAAG
GCTGTTCACA ACAGATTTCA GGAAATAATC AACAATTTCG GAATGCGTTT TGGTCCGGAA
AACGAAAAAT AG
 
Protein sequence
MHLPTCRSDV NKGSRSVRLI ATFAIVQLAL HLYTVVESPD PRLFLPHFIA WSHDLLILSI 
LFFVFSRAIA LFPSRFRNCA ELMTLPIIAL ALLPLTLYPR MLREYLSFPV NLFTATPASA
SAMLTKYLGL SKLMPVAFAV ASALVALLML PFPSWSKKGK LLFTVFRGIL LTVAILTLPR
SPHPVVNSLK EEMSAVLSHE RREVPALFSA PHRQDNQKPS GSGVLSLQEK LKADHIYLIV
LEGVSADQFE NAISGTESRF YRRISRHARY FDRYYTTNLD SYTSLIAMLT SEQVPYRSYT
DTGLYDAVNN APNLARSFKD IGFHTLFISI YDDQPFIPVR RDWSKIMHRH DLPAGKQWVS
VESSRMESAT EDRAALSTLG KLPSLYPKTF VLHELAYGHT TEWRAKTGIP QLAYYDTYLN
ELLDLLIANG TWSKSLMVIV SDHGDRAKGA NTESYRVPLM IVGQDVAQGI DHTFRSHLEL
QQIMVSSLTG NTMPEPKKEA IVVGSTERWI YGLIDAHGDH LVIDDRTGKV VASNGKLSSK
AVHNRFQEII NNFGMRFGPE NEK