Gene Cpha266_2264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2264 
Symbol 
ID4568486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2594437 
End bp2595744 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content59% 
IMG OID639766826 
ProductHipA domain-containing protein 
Protein accessionYP_912680 
Protein GI119358036 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTACAA CAGCAAGGGT AAACCTGTGG GGGCGCACGA TCGGGGCTGT ATCGCTCGGT 
AGCGACGCTG CGACTGCAAC CTTCGAGTAC GATCCGGCCT TCGTTCAGAG CGGCATCGAA
ATCGCCCCGT TGACCATGCC GCTCTCCGGT CAGCTCTACT CCTTTCCCTC GCTGCGTCCC
GAAACCTTCC ATGGGCTTCC GGGACTGTTG GCGGATTCGC TGCCGGATCG GTTCGGCAAT
GCGCTGATCG ATGCCTGGCT GGCCCGTTCC GGTCGCACAC CCGGTTCCTT CAATGCCGTC
GAGAGGCTCT GTTATACGGG GTCTCGGGGC ATGGGCGCCC TTGAATATGC TCCAGCCATA
CGGTTGGGGG TCTCCGGCTC TGCGCCGGTC GAAATCGAAC GGTTGGTCGA GTTGGCTTCG
GAGGTGTTGA CCCATCGCAA CGATCTGCAG GTCTGGTTCC ACGATGAGGG CAAGGAGCTT
GCGCTCGGGG AGATTCTCCG GGTCGGCACC TCCGCGGGCG GAGCGAGAGC CAAGGCGGTG
ATTGCCTGGA ACCCGGAAAC CGACGAAGTT CGTTCAGGCC AGGTGAAGGC CGGAAAAGGG
TTCGAGTACT GGTTGCTCAA GTTCGACGGG GTGAGTGGCA ACAAGGACAA GGAGCTGGAA
GATCCAAAAG GGTACGGTGC AATCGAGTAC GCATACTACC GCATGGCGCT GGATGCGGGA
ATCACCATGA CGCCCTGCCG ACTGTTCGAG GAAAACGGTC GTCGCCATTT TATGACGAGG
CGCTTTGACC GGTTGGAGGA TGGAGGCAAA CTGCACATGC AGTCGCTCTG CGGCATAGCG
CATTACGACT TCAATCAGGC GGGAGCATAC GGGTATGAAC AGGCGATGCA GGTCATTCGA
CGCCTTGGTT TGCCGATGGC TTCCGTCGAG GAACAGTTCC GGCGAATGGT GTTCAATATC
GTGGCCCGCA ATCAGGATGA CCATGTGAAG AACATTGCCT TTCTGATGGA CAGGTCGGGC
AACTGGTCGC TTGCGCCAGC GTTCGATATT ACCTGGAGCT ATCAACCGGG GGGAGCGTGG
ACATCGACCC ATCAGATGAC GATGAACGGC AAACGGAGCG GATTCCTGCC GGACTATTTC
AAGGCATGTG CGAAAAGCGC ATCCATGAAA CGCGGGCGAG CCGAAACCAT CGTCGCTGAA
GTGCAGGACG TTGTTCGCAG ATGGCATGAT TATGCCGAGG AGTCGCGCGT CACTCCCCGA
CAACGGGATA AGATTGCAAC AACGCTGGGA CTGGAGGGCT TTGTATAA
 
Protein sequence
MSTTARVNLW GRTIGAVSLG SDAATATFEY DPAFVQSGIE IAPLTMPLSG QLYSFPSLRP 
ETFHGLPGLL ADSLPDRFGN ALIDAWLARS GRTPGSFNAV ERLCYTGSRG MGALEYAPAI
RLGVSGSAPV EIERLVELAS EVLTHRNDLQ VWFHDEGKEL ALGEILRVGT SAGGARAKAV
IAWNPETDEV RSGQVKAGKG FEYWLLKFDG VSGNKDKELE DPKGYGAIEY AYYRMALDAG
ITMTPCRLFE ENGRRHFMTR RFDRLEDGGK LHMQSLCGIA HYDFNQAGAY GYEQAMQVIR
RLGLPMASVE EQFRRMVFNI VARNQDDHVK NIAFLMDRSG NWSLAPAFDI TWSYQPGGAW
TSTHQMTMNG KRSGFLPDYF KACAKSASMK RGRAETIVAE VQDVVRRWHD YAEESRVTPR
QRDKIATTLG LEGFV