Gene Cpha266_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2189 
Symbol 
ID4571001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2525166 
End bp2526227 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content50% 
IMG OID639766762 
Producthypothetical protein 
Protein accessionYP_912616 
Protein GI119357972 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATGA GAAAAAAAAC AGGTAAGGAG GTCTCTATTG TTCGCTCCTC GGCGGCTGAA 
TACCTGACCT TTATCGCTGC CAGCGGCACA GGGGGCGTCG ATGCGGTTTA TGCCGATGAA
AATATCTGGC TTACCCAGAA GATGATGGGG GTGCTCTATG ACGTGGCGAC TCATACCATA
AACTATCACC TGAAAAAAGT TTTTTCAGAC AGCGAATTAC AGGAGGACTC AGTTATTCGA
AATTTTCGAA TAACTGCCCG AGACGGAAAA AACTACAACA CCAAACATTA CAGTCTTGCT
GCTGTCATTG CCGTCGGTTA CAAGGTAAAT TCCGAACGTG CAGTACAATT CCGCAAGTGG
GCGACCACTA TCATTAAGGA GTTCACCATC AAGGGCTATG CCATGGATGA CGAACGACTG
AAAAGCGGTG GCTCCATCCT TACCGACCAG TATTTTGAAG AGCAGTTGCA GCGTATTCGG
GAGATTCGCT TGAGTGAACG CAAGTTCTAC CAGAAGGTCA CCGACATCTA TGCAACCTCC
ATCGATTACG ACGTGACAGC CCAGGCTACC AAGCGCTTTT TCGCTACCGT GCAGAATAAA
CTGCACTGGG CAATACATGC AGAGACCGCA GCGGAGGTTA TCTATAACCG GGCCGATGCC
GAAAAACAGA ATATGGGGTT GACCACCTGG AAGGATGCTC CCGGAGGAAA GATCCAGAAG
TTCGACGTTG TGGTCGCAAA GAACTACCTG ACCGAACATG AAATAGCACA ACTTTCACGG
TTGGTTTCGG CATACCTGGA TGTTGCAGAG GACATGGCGC TACGCAAGAT GCCCATGACC
ATGCAGGACT GGGAAACCCG CCTCAATCGC TTCATCGCAG CGACTGATCG TGAAATTCTT
CAGGATCCGG GCAAAGTGAC TGCAGAAATT GCCAAAGCTC ATGCCGAAAG TGAGTTTGAA
AAGTACCGCA TCGTTCAGGA CAGGCTATTC GAAAGCGACT TCGACAGAAT GGTCAAGGAG
ATCGAGTCTC TGCAGAAGCC GAAGGGAGGG GGTGATGAGT AG
 
Protein sequence
MSMRKKTGKE VSIVRSSAAE YLTFIAASGT GGVDAVYADE NIWLTQKMMG VLYDVATHTI 
NYHLKKVFSD SELQEDSVIR NFRITARDGK NYNTKHYSLA AVIAVGYKVN SERAVQFRKW
ATTIIKEFTI KGYAMDDERL KSGGSILTDQ YFEEQLQRIR EIRLSERKFY QKVTDIYATS
IDYDVTAQAT KRFFATVQNK LHWAIHAETA AEVIYNRADA EKQNMGLTTW KDAPGGKIQK
FDVVVAKNYL TEHEIAQLSR LVSAYLDVAE DMALRKMPMT MQDWETRLNR FIAATDREIL
QDPGKVTAEI AKAHAESEFE KYRIVQDRLF ESDFDRMVKE IESLQKPKGG GDE