Gene Cpha266_1429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1429 
Symbol 
ID4568990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1623620 
End bp1625194 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content50% 
IMG OID639766015 
Productintegrase catalytic subunit 
Protein accessionYP_911881 
Protein GI119357237 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.398273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAACA AGGCCTTGAC TATGTTACAA GTTCGACGTA TTCTCAAACT CTTGATGGAG 
GAGTGTTCCC AACGGGAAAT CCATCGCAGT ACAGGTATTC ACCGCGTCAC CATCAAAAGC
TATCTGCACC GGTTTACGAG CAGCGGAAAA CCGTTTTCAG AGCTGTATGC GCTCTCTGAT
TACGATCTTT CTGTTCTGGT TCACCCACCC CGTTCCACCA AAACCTCTGA TGAACGGTAT
GCAGATCTCC AGCCCCAACT GCAACGTTTT TCTGATGAGC TGAACAAGAC GAACTCTCAT
GTTACCAAGC AGGTGTTATG GGAAGAGTAT CTTCAGGATC GACCTACCGG GTATCAATAT
TCCCAGTTTT GCTATCACGT GGATCAGTAC ATAAAACAGC ATGCCGTCAC GATGCCGCAG
CAGCATGAGC CGGGCTACCG ACTGCAGATC GACTTTGCTG GTGATCCGCT CTGGATTATC
GACCCGCTTA CCAGAGAACG CATCAAGTGC CCGGTTCTGG TCTGCACGTT GCCTTGCAGC
AGCTTTTTTT ACGTTGAACC GCTCTCATCT TGCAGGCAGG AGCACCTGAT TCCTGCACTC
AATCGGGCGC TTGCCTATTT TGGCGGTGTT CCCAAAAACA TTCTGAGCGA CAACATGAAA
CAGGTCGTGA CAACAGCATC ACGCTATGAG CCTGTTTTCA ATGATCTTAT GGAACAATGG
GCCTTGCACT ATCAGACCAA CATGCAGGCA ACCAGAGCCG TCAGGCCCAA GGATAAGCCA
TCTGTTGAAG GCTCGGTGCA CCATGCTTAT CAGCAGATTT ACGCAAGGTT GCGCAATGAG
GAGTTCACCA GTCTGAGTGC GTTGACGTAT CGGGTTCGGC ATCTGCTTGA TACGGCCAAT
GATCGGCTGA TGACCGATTA TGGCAAGAGT CGCAGACAGC GGTTTATAGA ACTTGAGCAA
GAGTTTTTAC AGCCACTGCC GCTGACTGAT TTTGTGTACA AGCGTGAAAC AACTGCCAAA
GTCAAGAAAA ATTATCATGT CATTCTGGGC GAAGACCGCT GCCAGTACAG TGTTCCGCAT
GAGCATATCG GCAAAATCGT CAAGCTGATC TATGATGAAT CGGTGGTTGA GGTATTTCTT
GATTTCCAGC GTATCGCCTT GCATCAGCGC ATCGTCGGAC GCCGGGGCAT CTACAGAACT
GTCGAGGAAC ATATGCCGGA ATCACATCGC CGGTACCATC AGCAACAAGG GTGGACTGAG
GAGGACTTTA CCAGCAAAGC TGCCGCTGTC GGGCCCTGTA CCGAGGAAGC TGTTTTGCGG
CTTCTGAGTT CAAAAGCTTT TGCACAACAG AGCTTTGATG CCTGCCTGGG CATTCTCCGG
CTCCAGAAAA AGTATGGAAC AACAAGACTC GAAGCGGCTT GCAGTGTAGC CCTGCAAGTC
CCACGCCTCA ACTATCGACT CGTCAACAAC ATTCTGGAAA ACAACAGGGA CAAGGTCTCT
GTTGCAGCAG GAGAACAGCG TGCATCACTG CTTCCGTTGC ATGACAATAT TCGCGGTAAA
GAAGTCTACA ATTAA
 
Protein sequence
MANKALTMLQ VRRILKLLME ECSQREIHRS TGIHRVTIKS YLHRFTSSGK PFSELYALSD 
YDLSVLVHPP RSTKTSDERY ADLQPQLQRF SDELNKTNSH VTKQVLWEEY LQDRPTGYQY
SQFCYHVDQY IKQHAVTMPQ QHEPGYRLQI DFAGDPLWII DPLTRERIKC PVLVCTLPCS
SFFYVEPLSS CRQEHLIPAL NRALAYFGGV PKNILSDNMK QVVTTASRYE PVFNDLMEQW
ALHYQTNMQA TRAVRPKDKP SVEGSVHHAY QQIYARLRNE EFTSLSALTY RVRHLLDTAN
DRLMTDYGKS RRQRFIELEQ EFLQPLPLTD FVYKRETTAK VKKNYHVILG EDRCQYSVPH
EHIGKIVKLI YDESVVEVFL DFQRIALHQR IVGRRGIYRT VEEHMPESHR RYHQQQGWTE
EDFTSKAAAV GPCTEEAVLR LLSSKAFAQQ SFDACLGILR LQKKYGTTRL EAACSVALQV
PRLNYRLVNN ILENNRDKVS VAAGEQRASL LPLHDNIRGK EVYN