Gene Acid345_3161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3161 
Symbol 
ID4071231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3755008 
End bp3757494 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content60% 
IMG OID637985181 
Producthypothetical protein 
Protein accessionYP_592236 
Protein GI94970188 
COG category 
COG ID 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAACA ACGTCAAGGT AATCATCCAG GGCGTAAACC AGGCTGGCGC TGCGATCGCG 
GATGTAAAGG AAGGGCTGGC GTCGATTGAG GACGCGACCT TGGGAGTTGG AGAAGCTTTT
GGCGCGATGC AGATCGCGGA ATGGGCGAAA GAATTTGGTG ATTTTGTTGC CGAGACCGCC
GATTCGGTAG CCGCGCTAGG TCGCCTCTCG CAGCAGAGTG GTACCGCGGC TTCGGAATTT
ATGGCACTGA AAGGCGCGGC CGAGCAGAGC GATATTCCCA CCGAGACGCT TGCAATCTCG
CTGAAGAAGT TATCAACGAA CATGGCGGAA GCCGGTGCAG GGAACGCAAA GGCATTACAG
CTCTTTAAGG ACCTTGGCGT CAGTGCGACC GATGCCAGTG GGAAGCTCCG CCCGGTGACC
GACGTCCTTC TCGATATCTC TGAGCGTTTC AAGGATTATG CGGACGGTGC GGGGAAATCG
GCTCTTGCTG TGCAGGCATT TGGTCGCGGC GGCGATGCGA TGATCTCCTT TTTGGATAAG
GGCAAACAGG CGATTTCGGA GGCCGCGGCC GAAGCGACGA AGTTCGGGGA CGTTCTTGGC
GGTGACGCTC TAGATGCTGT GATTCGGTTT CACGATGAGA CCGTAAAGCT GCAAATGGAG
GCCGAGGGTT TCAAAGTTCA ATTCGTCGCT GCGCTCACTC AGGAGCTCGG GCCTCTCGCT
GACGCTTTCA AATATGCCAC TGGCAACACG GATTCCTTCT CTGCGTCGTT CGGTAAGTTC
CTTGGGCATG AGGGTGGAGC GTTTTTCAAA GATACGATCC GCGATGTGGC CGGATACGGT
CTCGCCATCG CCGAAGCAAG CCTTGAAATT TCGAAGATTA CGGCCGAGTT TTTTGGAGCC
GAAGCGGCCG CACAACAATT CGATCAAGGC CTCGCGGCGC TCAAAACAGA AACGTCCGAT
TTTTTCAAAC TCCTCGATGC GGCGCCGAAG GGTTTGCTAT CGGGCTTTCT TGGAGGCCTC
GGTTCGGATG TCGATGCCGC AATAGCAAAC CTGGATAAGG CTTTCCCCAC AAAAAAGCCA
AGTCTTACGG TTACTGCGCC GGATAACACC GAAAAGCTGC TCAAAGCCCA GCAGGAATTG
CGCAAGGCGT ATGCCGCGCA GGATGTTGCG ATCGCGCAGG GCGAAGCCAA GGACCAACTC
GCGGCGCTCG AGCTCGAACG CGAGCAGGGG CTCGTCACGC TCGGCGATTA CTACAAGCAG
CGTTACGACA TCACGATCGG CGCCGTCGAT AACGAGATCT ATGCGCTCTC GCAGGAACTG
GCCGCGCAGC AGAAGGTTGT GGACGCCGCG AAGAAGGGCT CGCCCGAGCA GGTCCAGGCC
GAAGCGCAGC TGGTGCAGAT CCAGAACCAG GTCATCGCCA AGATGAACGA GCGCGGCGCG
CTGGTCAGCT CGCAGGCGAC GTCGCAGATC AAGGCCGAAC GCGACTTCGG TTACCAGGTG
CTGGCCATCC AGGCGCAGAT TGACGAGGCG AAGGACCACT CCGCGGAAGT TGCGATCGCG
GCGATCAACA AAGAGTACGA CGAAAAACGG CGCATCCTGC AGGCGGCCGG CAAAGACACT
ACCGAGCTCG ACCAGGCTAA GCAGATGGCG ATCGCCGCGG CGAAGGCCGA GAACCTCGCC
AAGCAGATTG ACACCGTGTA TGCCGACCTC GAAGAGAAAG TCGCGGCTGT AAACGAGGCG
ATTTCAAACG GCACAATCAA CCAGATCGAC GGGCAGACGA AGGTCCAGGA GCTCAACGGG
CGCGCCGCCG GTCAGCTCAC CGGACTGATC CAGCAATACC AGGCGCTCGC TGAAGCATCC
GGCAATCCCG CGCTGATTAC GAACGCCGAT AAGTTCCAGA AGAAGCTCGA CGATCTCGGT
AAGCACGCCT CGCTGATGGG CGACCTGGCG CGACAGGCCT TCGAGCGCGG CTTCGAGAGC
TTCCTCAGCG ACGTCGAGAA CGGCAAGTGG ATCAACGCCC TCGAGGACTT CGGCAAGGCT
TTCCTCGACG TGATGAACCA GATCCTGGCC AAGATGCTGG CCGTGTACGT GATGCAGAAG
GTGCTCGGCT GGATCGGCGG CGCCGTGGGT GGAGTTGCTT CCACGCCGGG CGGCAGTCCC
ACCGGCGGTG GCGATGTTGG CTCGCTGGGG GACGTGGGCA TTACGCCGTT CTTCGCCTCG
GGTGGCCACA TGGATGCCGG CGACATGGGT GTAGTCGGCG ACCAGGGCCC GGAGCTGTGG
GTGCCCGACG TCGGCGGCCA TGTCGTGCCG ATGTCCGGCA TGAGTGACAG TTCCGCGCTC
ACGGTGCACA ACCACATTGC TGTAGACGCA CGCGGCGCGC AGGTCGGTTT TGCGGAGCAG
CTTGCCATCC AGCTCCGCGC CACGGAAGAG AAAGCAGTAA AGCGGGCGGT ATTTGAGACC
GAGGAGCGGA GGAGACGCAG AGCATGA
 
Protein sequence
MDNNVKVIIQ GVNQAGAAIA DVKEGLASIE DATLGVGEAF GAMQIAEWAK EFGDFVAETA 
DSVAALGRLS QQSGTAASEF MALKGAAEQS DIPTETLAIS LKKLSTNMAE AGAGNAKALQ
LFKDLGVSAT DASGKLRPVT DVLLDISERF KDYADGAGKS ALAVQAFGRG GDAMISFLDK
GKQAISEAAA EATKFGDVLG GDALDAVIRF HDETVKLQME AEGFKVQFVA ALTQELGPLA
DAFKYATGNT DSFSASFGKF LGHEGGAFFK DTIRDVAGYG LAIAEASLEI SKITAEFFGA
EAAAQQFDQG LAALKTETSD FFKLLDAAPK GLLSGFLGGL GSDVDAAIAN LDKAFPTKKP
SLTVTAPDNT EKLLKAQQEL RKAYAAQDVA IAQGEAKDQL AALELEREQG LVTLGDYYKQ
RYDITIGAVD NEIYALSQEL AAQQKVVDAA KKGSPEQVQA EAQLVQIQNQ VIAKMNERGA
LVSSQATSQI KAERDFGYQV LAIQAQIDEA KDHSAEVAIA AINKEYDEKR RILQAAGKDT
TELDQAKQMA IAAAKAENLA KQIDTVYADL EEKVAAVNEA ISNGTINQID GQTKVQELNG
RAAGQLTGLI QQYQALAEAS GNPALITNAD KFQKKLDDLG KHASLMGDLA RQAFERGFES
FLSDVENGKW INALEDFGKA FLDVMNQILA KMLAVYVMQK VLGWIGGAVG GVASTPGGSP
TGGGDVGSLG DVGITPFFAS GGHMDAGDMG VVGDQGPELW VPDVGGHVVP MSGMSDSSAL
TVHNHIAVDA RGAQVGFAEQ LAIQLRATEE KAVKRAVFET EERRRRRA