Gene Cpha266_0254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0254 
Symbol 
ID4568960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp283850 
End bp285070 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content50% 
IMG OID639764857 
Producttransposase, IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_910744 
Protein GI119356100 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.151217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACC TTACCCTTTT TCAGATGGCC TTGGGACTTG AGTCCCCATG GTATGTATCG 
TCCTCATCAT TTGATGTCGA CCAAAAGCGC TTGGACATAC GAATAGATTT TAAACCGGGC
AGCACCTTCT GTTGTCCTCA ATGCGGCCGA GAAGGCGTGA AGGCCTATGA TACCTCCGAG
GCAACATGGC GCCATCTCAA CTTCTTTCAG CATGAGGCCT ACCTGACAGT TCGAGTGCCT
CGTATATCCT GCCCTGAGTG CGGCATTCTC AAGCTGCAAT CATTTCCCTG GTCTCGCCGT
GAGAGCGGCT TTACTCTGCT GTTTGAGGCG ATGATCATGA TCATGGCGAA GTCAATGCCG
GTCAAGGCAA TAGCCGCCAT TGTCGGCGAG CATGACACCC GTATCTGGCG GATCATCAAC
CACTATGTCG AAAAAGCCCG AGAGCAGGAG GATCACTCGG CAGTCACCAT GGTAGGTGTT
GATGAAACCT CCAGCAAGCG CGGTCATAAC TATGTGTCGC TGTTCGTTGA CCTTGCCGTA
TCGAAAGTGT TGTTTGCCAC TGAAGGGAAA GATGCAGCAA CGGTCAAGCG ATTCAGTGAA
GATCTTGCCG CCCATAAGGG TGATCCGGCA TTGATCACCG AATTCTGCAG CGACATGTCA
CCGGCATTCA TCAAAGGGGT CGCCGATAAC TTTACCAATG CCCAACTGAC CTTTGACAAG
TTCCATATCA TGCAGGTCAT TAATAATGCT GTCGATGAAG TGCGGCGTCA GGAGCAAAAA
GAGCGCCCTG AATTGCAGAG AAGCCGGTAC ATCTGGCTGA AAAACCAGAA CAACCTGAAG
GCTTCACAAC GCAAACGCCT TGATGAGTTA TCCTTGCCCC GACTGAATCT GAAAACAACT
CGAGCATACC GCATGCGACT AACTTTTCAG GAGTTTTTCG AGCAACCTCA GGTATTGGTG
GAAGCATTTC TGAAGAAGTG GTATTTCTGG GCAACCCACA GCCAGCTGCA GCCAATGAAA
GAGGCGGCTT ACACCATCAA ACGACACTGG TCTGGCATTC TGCGATGGTT CACTTCTCGT
ATCAATAATG GGGTACTTGA GGGAATCAAT AGCCTCATCC AGGCGGCCAA AGCACGGGCA
CGGGGTTACC GTACTACCAA AAATCTCATC AATATGATCT ACCTGATCAG CGGGAAGCTT
AATTTTGGCT TACCCACTTG A
 
Protein sequence
MNDLTLFQMA LGLESPWYVS SSSFDVDQKR LDIRIDFKPG STFCCPQCGR EGVKAYDTSE 
ATWRHLNFFQ HEAYLTVRVP RISCPECGIL KLQSFPWSRR ESGFTLLFEA MIMIMAKSMP
VKAIAAIVGE HDTRIWRIIN HYVEKAREQE DHSAVTMVGV DETSSKRGHN YVSLFVDLAV
SKVLFATEGK DAATVKRFSE DLAAHKGDPA LITEFCSDMS PAFIKGVADN FTNAQLTFDK
FHIMQVINNA VDEVRRQEQK ERPELQRSRY IWLKNQNNLK ASQRKRLDEL SLPRLNLKTT
RAYRMRLTFQ EFFEQPQVLV EAFLKKWYFW ATHSQLQPMK EAAYTIKRHW SGILRWFTSR
INNGVLEGIN SLIQAAKARA RGYRTTKNLI NMIYLISGKL NFGLPT