Gene Acid345_3389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3389 
Symbol 
ID4072725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4010965 
End bp4013034 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content64% 
IMG OID637985411 
Productintegrase catalytic subunit 
Protein accessionYP_592464 
Protein GI94970416 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC TCTGGCTGAT CGAAGCTGAT GTGCTCGCCG CCATCTCGTT CAGTGAACGC 
CATCTCCGCC GTCTCGCCCG CGAAGGCAAG GTCATCAGCC GCGCGTCCGC TACTAAAGCC
GCCAATGGCC GCTTCGTTCG CGAGTACTGC ATCGAGAGTT TTCCCACCGA TCTTCGCGAG
AGGCTGCTCG CCGTGCGAGG TGCGATCACT CTCCTCCCAC AGAACAGCGC AACCGCCGCT
GAGCAGCCCC TCCTGAAGAT CGCCTGCGTT GACGAGGAAG AAGAAGCCCA GGCCGCGCGG
CGCGAAGCTG CCTGCCTCGC CATCGCCAAC TTCGAACAGC ACAAGGCGCA ATGGGCTACC
GTGCGCCTAG CGGATGGCGA GCCTGTCACT TCCATAAGCC GCCTGGTCGA GCATCTCGCG
GCGAAGTATT CATCCAGCCC CGCGACAATC TGGCGCTGGC ACAGCCGCTT CAAGAAGGGC
GAACGACTCG CCGACCGGAC CCGCGAAGAC AAAGGACGCT CGCGCTGGTT TGCTCGGCAC
GGCGACGCCG CAGCAGCCGC TGCCTATATC GGGTTGAAGT GGGGCGCGCG TGAAGCGCAT
CGCTCGCTGC TCTCTCACCA TGAGCTCCTC GGCATCGCGA AAGAGGAGAT GCCGTCTTAC
GAGACGGTGC GCTCGTTCCT GAATTCCGCG CCGCCGGCGA TGAGCATTCT GGTGCGCGAT
GGCGAGCGCC GCTACCGCGA CCTGATCTCG CCCTACGTCC GTCGCGGCTA CAACGAATAC
GCAAATCAAA TCTGGGTCAG CGATCACATG ATCCACGATC TCTTCGCGCA GAACGATGTC
TTTGACGATA TCCCGCGCGG CCAGCGTATT CGCATGCGCC TTACCGCGCT GCTCGATTTC
CGCGCCCGCT ACGTCGTTGG TTACAGCTTC GCCGAAGAAG GCAGCTCAAT CTCCATCACA
ACCTGCCTAC GCCAGGCGAT CGCCAGCTAT GGCGCATGCG AAGAGTTCTA TTGCGACAAC
GGCAAGGACT ACAAATCCGT TGCCAAGGCC GCGCTGCCCG CGTATCTGCG CGATTCCGGT
AAGGCCCCAC AGGACTGGTG GCAGCAGGAG CTCGACACGC AGGCCGGCGT GCTTGCGCGC
TGCGGGATCT CGATCCGCCA TTGCATCGTG CGGCATCCGC AGTCGAAACA CGTTGAACGC
TTCTTCCGCA CCGTGCACAA ACAGTTCGAC GCGCTCTTCC CAACTTATAG CGGCTCGAAT
CCAGATCGCC GTCCTGAGTT CACATCGAAG GCCATCGCCG AGCACAGCCG TCTGGAGCGT
GAGAGCGCGC GACTGATCCA GATGGGCAAA GGCATCAACG GTCTGCACCA ATCGCTCTTG
CCGCCAGCAA CGCTGGTGAT GAAGCTCTTC CGCGCCTGGC TTGACGAGTA CCACAACACG
CCGCATGGCG GTCAGGGTAT GGACGGTCGC ACACCGGCGC AGGTCTTCGA GCAGGAGCGC
AACCCGTTGC AACGCCCCGC GCCGGCCGAC AACGTTCTGG CCCTCATGTT GTGTTCGCGG
GAGCAGCGCA TGGTGCGCGA GTGCTCGGTC ACCGTGGGCA AACGGCGCTT CATCGGCGCG
GATTTCACCG CGGTGAAGCG ACTGCACGAC GTGAGCAACT GCGAAGTGAT GGTGGCCTAC
GACCCGCTCG ATCTCGATCG CGTGGCGGTC CTCGATCTCG ACGGCAACCT CATTTGCTGG
GCCAAGCCCG AAGAGTTCCT CCCCCAGAAC ACCACCCAGG CGGCGAACGC GATTGCGGAA
AGCATGCAGC AGCGCCGCCG CCTGGAGCGC AACACACGCG ATGCCTACGT GGCCATGCGC
GATGCCGCCC GCGCCTCCGG TGTCGTCACC GGCGTGGAGC GGTTGGTGAA CAAGGTGCTC
GCGCTGCCAC CCGCCAGCGA GGTGCCGGCA GTGCAACGCA GTTCAATGGC GCGCGCGGCC
AAGGCAGCAG CAGCGGCCGC ACCCAAGGTG AGCCAGAAGT ATGTAGGCGA CGTAGCCGAA
GAGATCGCCG GATTGATGGA GGGGGACTGA
 
Protein sequence
MSDLWLIEAD VLAAISFSER HLRRLAREGK VISRASATKA ANGRFVREYC IESFPTDLRE 
RLLAVRGAIT LLPQNSATAA EQPLLKIACV DEEEEAQAAR REAACLAIAN FEQHKAQWAT
VRLADGEPVT SISRLVEHLA AKYSSSPATI WRWHSRFKKG ERLADRTRED KGRSRWFARH
GDAAAAAAYI GLKWGAREAH RSLLSHHELL GIAKEEMPSY ETVRSFLNSA PPAMSILVRD
GERRYRDLIS PYVRRGYNEY ANQIWVSDHM IHDLFAQNDV FDDIPRGQRI RMRLTALLDF
RARYVVGYSF AEEGSSISIT TCLRQAIASY GACEEFYCDN GKDYKSVAKA ALPAYLRDSG
KAPQDWWQQE LDTQAGVLAR CGISIRHCIV RHPQSKHVER FFRTVHKQFD ALFPTYSGSN
PDRRPEFTSK AIAEHSRLER ESARLIQMGK GINGLHQSLL PPATLVMKLF RAWLDEYHNT
PHGGQGMDGR TPAQVFEQER NPLQRPAPAD NVLALMLCSR EQRMVRECSV TVGKRRFIGA
DFTAVKRLHD VSNCEVMVAY DPLDLDRVAV LDLDGNLICW AKPEEFLPQN TTQAANAIAE
SMQQRRRLER NTRDAYVAMR DAARASGVVT GVERLVNKVL ALPPASEVPA VQRSSMARAA
KAAAAAAPKV SQKYVGDVAE EIAGLMEGD