Gene Cpha266_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2226 
Symbol 
ID4569448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2550665 
End bp2551948 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content49% 
IMG OID639766794 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_912648 
Protein GI119358004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.820995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAACG AAACAGAAAG ACTCAACGCT TATCGGCAGA TGCGATCGAG TATCCGTAAT 
TCCGGGCAGT ATCTGATAGT GGGTATCGAT ATTGCCAAAG AGAAACATCA TGCATTTTTT
GGTACTGCCG CCGGTAAAGT CTTGTGCAAG CAACTGATCT TTACCAATGA CAAAAGCGGT
TTTGAACTGC TTATAGCACG AGCCGAGCAG CTCAGATTGC AGCATCAGCT AACGTATGTG
GTGTTCGGCA TGGAACCTAC AGCCAATTAT CATAAGCCAT TAGCTGAATA CCTTATTCAA
GAGGATGCGA TGGTGGTGCA GGTTTCCGGT ACGGCCGTTG TACGAAACCG GGAGTTGCTC
GATAATCGCT GGGATAAGCA TGATCGCAAG GATGCCGCCA ATGTGGCCGA TCTGGTGGGG
GCGGGAAAGT GCCAGTTTTA CGACAATCCG CCACAGGCGA TTCATGATCT TCGGGAGTTG
CTTAGCCTGA GGCGTCGGTA CCGTAAACTG GAATCCGGAA TCAGAACCCG TATCAGGAAC
AACCTGCTTG CGTTGTACTT CCCGGAGCTG GATTGCCGGT TCACTTCCTT CCAGCAGGAT
TGCCTGACCA TTATCAAGAC CTGCCTCTCA CCGGCAGCGA TTGCGGCAAT GCCCTTCGAG
GAGTTTAAGC GGCGGATCGT CATCCGGCAA AAAGGAAAAC GACAGGAATC GTTTCTGGAG
GATATCTGGA ACTCCGCTCA CCACACCGTC GGCAGGCCGG TCGATGAGAC CGTACAATAC
ATGGCTGCGC AGTCGGTAAG TCAGCTTGAA CACTTCAGAG CGGAAATCGA CAATCTTGAC
CGGCAGATTT TCATGATCGC CTCTTCTTTG CCAGAGTACA AGTATCTGAT CAGTATTCCG
GGATTCGGGC CGTTCATCAG CGCAAAACTT CTTGCCACTA TCAATGATCC GGATCGATTC
TCCAATGAGG CACAGGTGAT TAAACTGGCC GGTTTTGATC TCTGTGCCTC ACGAAGTGGC
AAGCCATCAG GTAAAGCGAT TCCTCAGATA TCCAAGAAAG GCAATGCTGA ACTGCGCTTT
GCCTTGGTAC AGGCGGCTAT TGTTGCCACC ACAAGAAATA CCCTGTTCAT CAGGTACCTG
AACCAGAAAC TGCAAGGACG AGAACAGGAA AAAGGAATTC TGAAGAAAAT GCGAACCAAG
GTAGCCTCAA AACTCCTTGT CATCGCTTGG ACACTGATGA AACAACACGA GTATTTTAAT
GGTGAACATT TGAGACTCAC ATAA
 
Protein sequence
MYNETERLNA YRQMRSSIRN SGQYLIVGID IAKEKHHAFF GTAAGKVLCK QLIFTNDKSG 
FELLIARAEQ LRLQHQLTYV VFGMEPTANY HKPLAEYLIQ EDAMVVQVSG TAVVRNRELL
DNRWDKHDRK DAANVADLVG AGKCQFYDNP PQAIHDLREL LSLRRRYRKL ESGIRTRIRN
NLLALYFPEL DCRFTSFQQD CLTIIKTCLS PAAIAAMPFE EFKRRIVIRQ KGKRQESFLE
DIWNSAHHTV GRPVDETVQY MAAQSVSQLE HFRAEIDNLD RQIFMIASSL PEYKYLISIP
GFGPFISAKL LATINDPDRF SNEAQVIKLA GFDLCASRSG KPSGKAIPQI SKKGNAELRF
ALVQAAIVAT TRNTLFIRYL NQKLQGREQE KGILKKMRTK VASKLLVIAW TLMKQHEYFN
GEHLRLT