Gene Acid345_3762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3762 
Symbol 
ID4069337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4446015 
End bp4447154 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content60% 
IMG OID637985784 
Productphage integrase 
Protein accessionYP_592836 
Protein GI94970788 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.288729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATAT CTCCCGTTGT CACGATCTTT GTCAGGCACT CCGCAGATTG CAAATATCAC 
GGGGAAGAGT TTGAGAAGCG CTGCCGTTGC CGCAAGCACC TCCGATGGAG CCAGAACGGA
AAGCAGTACC GACGCAAGGC CGGGACGCGC TCGTGGGGAG AGGCAGAGAA CGTAAAGCGG
GAGTTGGAGG CCCAACTCTC CGGGCGAGTC ACCGAGACGC CCGCCGCGCC CGAGCAACGG
TTACTACCAG AGGCCACGGA ACTTTTTCTC AAGGACAAGA AAGTTCAAGG CGTGTCCAAG
GGTGTGCTCG GCAAATACAC CCGAGAACTG GATCGGCTCC GGACCCACTG CGAACGAGCG
GGCGTCTACA CTGCGCAAGG AATCACCCGC GAGTTGCTCA CCGAGTTTGC AGCGACGTGG
GAGAGTGTGT ACCCGAGCAG TTCCACCCGT TCCAAGGTTC GCGAGCGCTG CCGCGCCTTC
CTCCGTTACT GCTACGAGTG TCAGTGGATT CCGCGTATCC CGGCACTGCC CAAGATTCAA
GTAGATGAGC CGGAGACGAT GCCCCTCACG GATGCGGAAT TTAAGCGGCT GCTTGACGCG
ACGTATGCGG AAGTTTCGGA CACGGACCAG CGGGCAAGGG TTCACGCGCT CTTTCAACTC
ATGCGCTGGA GCGGTCTAGC GATTGGAGAC GCACTCCGGC TGGAGCGCTC GAGAGTCATC
CACGACGAAG GGAAGGGTGT GCACCGCGTT GTCACCGCTC GGCAGAAAAC CGGAACGCCC
GTGTCCGTGC CGATCCCGCC CGACGTTGCG GAGGAAGTGC TCAAGGTGCT GAACGGAAAT
CCTCGCTACG TGTTTTGGAG TGGAAAGGGC GAGCCCGAGA GCATCTCGAA AAATTGGTCC
AAGTACTACG TTCGCCCGTG CTTCGAGGGA GCGAAGATCG AGAGCAACGG GAACATGATG
TCCCATCGCC TCCGAGACAC ATTCGCGTGT GACCTCTTGC AGAAGGGTGT GCCGTTGGAG
GAAGTGTCCA AGCTGCTCGG GCACGAGAGC ATCAAGACCA CAGAGAGAAG CTATGCGAAA
TGGATTCAGG CGCGTCAGGA CCGGCTCGAC ACGCTCGTGA TGACGACGTG GGCCAAGTAG
 
Protein sequence
MTISPVVTIF VRHSADCKYH GEEFEKRCRC RKHLRWSQNG KQYRRKAGTR SWGEAENVKR 
ELEAQLSGRV TETPAAPEQR LLPEATELFL KDKKVQGVSK GVLGKYTREL DRLRTHCERA
GVYTAQGITR ELLTEFAATW ESVYPSSSTR SKVRERCRAF LRYCYECQWI PRIPALPKIQ
VDEPETMPLT DAEFKRLLDA TYAEVSDTDQ RARVHALFQL MRWSGLAIGD ALRLERSRVI
HDEGKGVHRV VTARQKTGTP VSVPIPPDVA EEVLKVLNGN PRYVFWSGKG EPESISKNWS
KYYVRPCFEG AKIESNGNMM SHRLRDTFAC DLLQKGVPLE EVSKLLGHES IKTTERSYAK
WIQARQDRLD TLVMTTWAK