Gene Acid345_4275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4275 
Symbol 
ID4071847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5078992 
End bp5080173 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content57% 
IMG OID637986307 
Productphage integrase 
Protein accessionYP_593349 
Protein GI94971301 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0146194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAGGA ACGGAAGGCA TCAAGAAGGT CAAGTTTTTC GCAAAGGAAG TGGTTGGTAT 
CTTCGCTATC GCGAATCGGA ACACCAGCCG GACGGTTCAG TAAAACGGGT GCAGAAGTGC
AAAAAGTTGG CGGACTACGG TGGCGCTCTT CGAACGAAGA GCTCCGTCCG GGTTCTGGCT
GACGAGTTTC TTTCACCCCT CAACAACGGA ACAATCACGG TAGGCAGCAG CATGAGCCTG
ACAGACTTCA TCGAAAAACG GTATCTGCCG TACATCAAGG AGCACAAAGC TCCCAGCACG
TACGCCGGCT ACAAGAATCT TTGGAGTCTG TACATCAAGG AGCGCGGGAC GTCAGCGCTC
CGCGATTACC GCACCTGCGA GTGCGAGGAC ATGCTCCTGG AAATCGCTCG GGCCCATGAC
ACCGCGAAGG AAACGATCAA GCGCGTGAAG TCGTTCTTAT CGGGAACGTT CCGCTATGCC
AAGCGCCAAG GCGTCTTGCA TACGGAAAAC CCGATGTGGG ACACCGTGAT CCCAGAATGT
CGCGAAGGTG AGGAGACATA CGCATACTCG CTTCACGAAA TCCTGCGGAT GCTGGAATTG
GTTCCGGAAC CTGCGGCCTC GATGATCGCG GTCGCAGGAT TTGGCGGCTT GCGGAGCGGA
GAGATCCGCG GATTGCTGGT GGAGCACTAC AACCATGACT CGATCTTCGT CGCACAGTCA
GCATGGCGGT CCCAGGTAAA GAAGGTGAAG ACCAAGGCGA GCAAGGCGCC GGTCCCAGTC
GTCTCACAGT TAGCGGCGCG GATTGATGCA CATCTCAAGA CAATGGGTTC GCCGGCCAGC
GGCTTCATGT TTCCGAACGC CGTCGGCAAG CCGATCGGGA TGCAACGTGT GGCGGATGAA
GTCATTCGCC CAGCGCTCAA AGGCTCTGGC ATCGAGTGGC ACGGCTGGCA TGCACTGCGT
CGCGGATTGG CAACGAATCT CCACGGCTTG GAAGTGCCGG ACAAGATCAC GCAAATGATC
CTCCGTCACT CGAGTGTTTC CGTGACGCAG AGCTGCTACA TCAAAACCGT TGATTCGCAG
GCGGTTAAGG CGATGCGGAA ATTGGAGTGT GCAACTACTG TGCAACTGGC GAAGGCACAG
CGGGAAGCAA CTCCCGAGGT TTCGTCACCG AGGATTATGT AA
 
Protein sequence
MKRNGRHQEG QVFRKGSGWY LRYRESEHQP DGSVKRVQKC KKLADYGGAL RTKSSVRVLA 
DEFLSPLNNG TITVGSSMSL TDFIEKRYLP YIKEHKAPST YAGYKNLWSL YIKERGTSAL
RDYRTCECED MLLEIARAHD TAKETIKRVK SFLSGTFRYA KRQGVLHTEN PMWDTVIPEC
REGEETYAYS LHEILRMLEL VPEPAASMIA VAGFGGLRSG EIRGLLVEHY NHDSIFVAQS
AWRSQVKKVK TKASKAPVPV VSQLAARIDA HLKTMGSPAS GFMFPNAVGK PIGMQRVADE
VIRPALKGSG IEWHGWHALR RGLATNLHGL EVPDKITQMI LRHSSVSVTQ SCYIKTVDSQ
AVKAMRKLEC ATTVQLAKAQ REATPEVSSP RIM