Gene Acid345_3963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3963 
Symbol 
ID4072435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4688422 
End bp4689903 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content52% 
IMG OID637985989 
Productrecombinase 
Protein accessionYP_593037 
Protein GI94970989 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.636106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.385317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCG TTATTTACGC AAGAGTTTCG AGTCGCGAAC AGCAGCAAGA GGGGTTTTCG 
ATTCAGTCCC AACAACGAAG CTGCCGCGCC TACGCTACTA AGAACGGATT TAGGATCACA
CGCGAATTTA TCGACGTCGA AAGCGCAAAG CAATCGGGCC GGAAGCACTT CGGCGAGATG
GTCGATTTCC TTAAAAGAAC TAAGAACCCG GTTTGCCTAT TGGTAGAGAA GATGGACCGT
CTCTGTCGCA ACTTCGACGA CCTGGGCCTG CTTGAAAAAC TCGGCACGGA GATTCACTTC
GTAAAAACCG GCACCGTGCA TTCGAAAAAC GCGAAAGCGC AAACGAAATT CATGCACGGC
ATCGAGGTCG TTTCCGCGAA GTTTTATTCC GATAACTTGC GCGAAGAAGT CATTAAGGGA
ATGCGCGAAA AAGCCGAGCA GGGCATTTAT CCGGGACGTG CCCCGTTCGG TTATCGGAAC
AATCAAATAC TTAGAACCAT CGAAGTGCAT TCCGAAAATT CCGAAATCGT GAAATTCGCG
TTCGAGCAAT ACGCGACCGG CGGATTTTCT TTAATCACTT TAAGGAACGC GATTCGAGAA
CGATTCGGAA AAACCGTCAA CCGGAGTTAC CTGCACACCA TCCTGACGAA TCCGGTTTAC
ACCGGAGTTT TCGAGTGGGC CGGAAGGACC TACACCGGCA GTCATCCAAC ACTGATTTCA
TCGAATTTAT TCGAGAGCGT ACAAGCGGTG ATTCGTGGTT TTAACAAAGG CAAATATCGC
AAGGTCGATA TCGCGTTTCG TGGGATGCTC ACTTGCGCTC ACGACGACTG CACCGTGACG
GCAGAGCTCA AGAAAAACAA ATACGTCTAC TACCGGTGCA GTGGCGGTCG GGGCAAGTGC
GACCTTCCGC GTTTCCGGGA ACAGGAGATT TCCGACAAAC TCGGGACTCT GCTCAAAGAC
ATTTACGTCC CCGACGACGT TGTCGCGAAG ATCACCGCAG CGCTCGAGCA GGACGAACAG
AACTCCAAAG CCGAGATCGA ACGCCAGCGC CAGCGACTGG CAACGCAGAA AACACTCATT
CACGAACGCA TGGACAAAGC GTATGCCGAC AAACTCGACG GCAAGATTCC GGAAGAGTTC
TGGCAGCGCA AGATGGCCGA TTGGCAGACG GAAGAACGTC GAATCATCGA CGCGGAGGCC
GGGTTAACGA CACCAGCAGC CGAGCGCACC CTGAATGCGA AAAGGATTTT AGAACTCGCG
AATAAGGCGC ATTTTCTATA CGTTACGAGG AAACCGCACG AACAAGCCGA ATTGCTCAAA
AAGGTACTTT TGAACTGCTC GATAGACGGC GTAAGTCTTT ATCCAACTTA CAGAAAGCCC
TTCGATGTGA TCTTTGAAAG GGCGAAAAGT AATGAATGGT CGGGACGGGC AGATTTGAAC
TGCCGACCCC TCGCACCCCA AGCGAGTGCT CTACCAGGCT GA
 
Protein sequence
MDAVIYARVS SREQQQEGFS IQSQQRSCRA YATKNGFRIT REFIDVESAK QSGRKHFGEM 
VDFLKRTKNP VCLLVEKMDR LCRNFDDLGL LEKLGTEIHF VKTGTVHSKN AKAQTKFMHG
IEVVSAKFYS DNLREEVIKG MREKAEQGIY PGRAPFGYRN NQILRTIEVH SENSEIVKFA
FEQYATGGFS LITLRNAIRE RFGKTVNRSY LHTILTNPVY TGVFEWAGRT YTGSHPTLIS
SNLFESVQAV IRGFNKGKYR KVDIAFRGML TCAHDDCTVT AELKKNKYVY YRCSGGRGKC
DLPRFREQEI SDKLGTLLKD IYVPDDVVAK ITAALEQDEQ NSKAEIERQR QRLATQKTLI
HERMDKAYAD KLDGKIPEEF WQRKMADWQT EERRIIDAEA GLTTPAAERT LNAKRILELA
NKAHFLYVTR KPHEQAELLK KVLLNCSIDG VSLYPTYRKP FDVIFERAKS NEWSGRADLN
CRPLAPQASA LPG