Gene Acid345_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2069 
Symbol 
ID4069919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2479770 
End bp2480849 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content60% 
IMG OID637984083 
Productphage integrase 
Protein accessionYP_591144 
Protein GI94969096 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.244007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000737798 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTAGAA GAAAAACGCC TGACCGCGAC GGCGTGTACC AGCGCAAGGA CCGTCCGGGC 
CAGTTTTGGG CAAGCTGGAC TGATCAAAAT GGCAAGCGCC GGCAGCGGCG ACTACGCGGC
GTCTTCACTC TGACGCAGGC GAAAGAGTTG CTCGCTGCTG CGCAGCTGCG GGTGGAGCGG
ATCAAGGTGC TTGGCTTCGC GGAACCCACA AAGGAACGTT TCGAAAAAAT CGAAGCTCGC
TTTCTGCTGC ACCAACGCGC GCGCCTGACG AAAGCAGGCT ACGAACGTGA GCGTGGCGTC
GTCGAAACAC ATCTGCACGA TTACTTCGGG CAGATGCAAC TGGCCCTTAT CCGACGTGGC
GACATCGCAA ATTTCGTCAC GAAGCGGGCG GCCGAAGTCT CGCCGGCTAC GGTCGCGAAA
GAGCTTGTGA CGCTCAAGCA TCTCTTCCGT CTTTGCGTCG AATGGGAGTT GCTGGTGCTA
AATCCCGCAA CCGGTGTCAA GGCCCCGAAG GTTGCGCCTG GCCGTGTGCG CTGGCTGCAA
CCCGAAGAGC TGCGCGCCCT GTTGCAAGCC TGCCCTGATT GGTTGCGCCC GATCGCAGGT
CTCGCCGCGG CTACAGGGAT GCGGCGAAGC GAGGTGCTCG GCGTGCGATG GTTCGACGTG
GACCGCAGGA ACGGGTGTCT CACGCTGCGT CAAACCAAGA ACAACGAAAG CCGCATCGTG
TTTCTCAATC AATCTGCGCT GCACGTCATC GACTCGCTTC GAAAGGGGAA GTCAAGCGAG
AAGCTCTTCC CGAAGGTCAC GCCGGAACAA GTGAGCATGG GATTCCTTCG GACCTGCCGT
GAGCTTGGTA TCGAGGATTT TCGATGGCAC GATTTGCGGC ACTGCTTCGC CTCACTACTG
CGTCAGTCGG GCGCCGACTT GCAGGATGTA GCCGAGTTGC TTGGTCATAA GGATCTCCGA
ATGACGAAGC GATACTCCCA TCTCTCGCCA GCCCACCTAT CAGCGGCGGC CAGGCGATTC
GATGCCGTGC TCGGGGAGCC CTTGTCGCTT GCTGACCCGA GCGGAGGGCG GGGAGCATAG
 
Protein sequence
MPRRKTPDRD GVYQRKDRPG QFWASWTDQN GKRRQRRLRG VFTLTQAKEL LAAAQLRVER 
IKVLGFAEPT KERFEKIEAR FLLHQRARLT KAGYERERGV VETHLHDYFG QMQLALIRRG
DIANFVTKRA AEVSPATVAK ELVTLKHLFR LCVEWELLVL NPATGVKAPK VAPGRVRWLQ
PEELRALLQA CPDWLRPIAG LAAATGMRRS EVLGVRWFDV DRRNGCLTLR QTKNNESRIV
FLNQSALHVI DSLRKGKSSE KLFPKVTPEQ VSMGFLRTCR ELGIEDFRWH DLRHCFASLL
RQSGADLQDV AELLGHKDLR MTKRYSHLSP AHLSAAARRF DAVLGEPLSL ADPSGGRGA