Gene Acid345_3909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3909 
Symbol 
ID4072246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4624580 
End bp4628041 
Gene Length3462 bp 
Protein Length1153 aa 
Translation table11 
GC content59% 
IMG OID637985935 
ProductTonB-dependent receptor 
Protein accessionYP_592983 
Protein GI94970935 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000652763 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00581227 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGTCTT TCCGCCACGT AGCGATAAGC GTTTTCCTCT TTGTTGTTCT CACAGGCATC 
GGCCTGGCGC AGAACACCAG CCAGATCACC GGTTCGGTCC GCGACTCAAG TGGCGCGGCC
GTACCCAATG CCGAAGTGGT CGTGAGCAGT CCGGAGCGCG GTATCGAGCG CCCTACAAAG
ACCAACGACG CCGGTGAATA TGCAGTCAGC GGCATTCCCG CCGGTTCCTA CAACTTGAAA
GTTACGGCGC AGGGCTTCAA GTCCTACGAA GCGAAAGGCA TCGTGCTTCG CGTGGCGCAG
AAGACACGCG CTGACGCCGA CCTTCAGATC GGCGGCACGA CGACCGAAGT CACCGTAGCC
GGCGAGAGCA TCGGCCAGGT GGAAACGCAA TCGTCGGATA TGTCGGGCGT CGTCACCGGC
AAAGAGATTT CGCAACTTCA GTTGAATGGA CGCAACTTCA CGCAGCTCGT GACCCTCGTC
CCCGGCGTCA GCAATCAGAC CGGACAGGAT GAAGGCACCG TGGGCATCGC TGGGAACGTC
TCTTTCAGCT TCAATGGCGG CCGCACTGAG TACAACAATT GGGAGCTCGA CGGTGGCGAC
AATATGGACA ACGGGTCCAA CGCCACCCTT AACGTTAACC CAAGCTTGGA CTCCATCGCC
GAAGTCAAAG TGCTCACCTC GAACTACGGC GCGCAGTATG GCCGCAGCGG TTCTGGCACG
GTCGAAGTCG AGACCAAGTC TGGTACCAGC AGCTTCCACG GCGACGCATA CGAATTCGTT
CGCAACGACG CGTTCAACGC GAAGAGCTAT TTCTCTTACA CCGAGCCGCT GATCCCGGCT
TATAAGAAAA ACGACTACGG CTACACTCTC GGCGGACCGA TCTTCATTCC CGGGCACTAC
AACGAGAGCA AGCAGAAGTC TTTCTTCTTC TGGTCGCAGG AGTGGCGCAA AGAACGCGTT
CCGGCGCCGT TCAACATTCC GGTGCCGTCA GCCGCGGAGC GCGCCGGAGA CTTCAGTGAC
CAGTGCCCCG GCAACTCCTG TCCGCATATG GCTGACGGGA GCCCGTATCC CGGCAACATC
GTTCCGATTG ATCCGACGGG AAGTGCGCTT CTGGCGCTGA TACCGGGGGC GAATCTCGGC
TCAGGAGCGA GTTCGGTGTA CAACGCTTCG CCGACACAGC CGACCTACTG GCGCGAAGAA
CTCTTTCGCA TCGATCACAA TATCAACGAC AAGTGGCACG TGACGTTCCG CTACACCCAC
GATAGCTGGA ACACAATCAA CCCAACGTCA CAATGGACCG GTAGCGCTTT CCCAACGGTG
CAGACGAATT TCGTCGGCCC GGCAATCAGC ATGGTGGGGC GTGTCACAAC GACCTTCACG
CCGACGCTGG TCAACGAGTT CGTGATGAGC TACACCACCG ACCACATCAC GTTCTCTTCG
ACCGGAACCC CGAATCCGAA TGCCTGGCAG CGACCGCAGG ATCTCGCCAT GGGCTATCTC
TTTAACAACG GTTTTGGTGG AAAGCTTCCG GCGATCACCG TCTCCGATCC TGCTTACGGC
GGAGGCTTCT ACGAGGACCC GAACGGCGAA TGGCCGGAAG GCGCGTACAA CTCGAACCCG
ACTTACACCT TCCGCGACAA CTTGAACAAG ATCATCGGAA GACATAACCT GCAGTTCGGT
GCGTACTACG TTGCGGCACA GAAGAACGAA CTCAGCGGCA TCCTGGTCAA TGGATCGCTC
GGCTTCGACA GCACGTCGGC GGTTTCAACC GGGAATGCCT TTGCAGACAT GCTGACCGGA
AATATCGCGA GCTTCTCGCA GGGCAGCGAC AACATCAAGT TCTACAACCG CTACAAGATC
CTCGAACCCT ACTTCCAGGA CGACTGGCGC GTCACGCCGA AACTCACGTT GAACCTCGGA
ATCCGTCTCA GCGCGTTCGG GACTTACCGC GAGAAGGACA ATCACGCCTA TAACTGGGAC
CCGAAAGCCT ACGATCCAAC CTCCGCTCCG GTGTTCAATG CTGATGGTTC AGTGAGCGGC
GGCAACATTT ACGACGGGCT TGTGCAGTGC GGCAAGAGCA GCGTGCCGGA AGGCTGTATG
TCCGGCCATC TGTGGAACTG GGCTCCGCGA GTGGGCTTCG CTTGGGATCC GTTCGGCACC
GGCAAAACTG CTGTTCGCGG CGGCTACGGG ATCTTCTACG AGCACACCAA CGGCAACGAA
GCCAATACCG AAGGCTTGGA AGGGCAGTCG TCTCCGCTGA TCCAGACCGC TTCGCAGTCG
AGTGTGGTTG GGTACACCAA TCTTGGCGTC GCCGCCGGGC TTGACGCGCA GTTCCCGCTG
AGCTTCATCT CCGTTCCCAC GAGCGCCACA TGGCCGTACA TGCAGCAATG GCACTTTGAT
ATCCAGCACG AAATCATGAA GGACACCGTG CTGGTTGTGG CCTACGTCGG CAGCAAGGGC
ACCCACCTCG GCCGGCAGTC GGACATCAAC CAACTTCTCC CGACGCCGCT CGCCGACAAT
CCATTTAAGG CAGGCGAGGT CATCACCTCG GATGTCTGCA ACAACATGAT GACGCCTAGC
GGCGTTGCCG TGACCGGGCA GGCAGCAACC AATCTCGCGG TCGCTTGCGG CGCTGATGCC
AACCCATTTC GTCCGTACCT GGGCATCGGC ACCATCACCC GCTTGGAGAA CGAGTCGGGC
TCCACGTATC ACGCCTTCCA ACTCGCAGCA CGTCGCAACG TTGGACAGTT ACAGTTGAAC
GTCGCTTACA CCTGGAGCCA CTCCATTGAC GACGCTTCCG ACCGCTATGA CGGGTCGTTC
GTCGATGCCT ATGATCCGCG CCTGAATCGC GCCAGTTCGA GCTTCGATAT TCGGCACATG
CTTAACGTAG GCTACGTTTG GGACATGCCG TTCTTTAAGG ACCGTGGCTG GAAGAATATC
CTGCTCGGTG GCTGGGAACT GTCTGGCATT ACCAGCTTCC AAACCGGCAC ACCGTTTAGC
GTGCCGAACG GCGGCGCTTA CGGTGACAAC GCTGGGGTCG GCAATGGCGT CGGTACCGGT
TCGTATGCGG ATGTCGTCTC GGATCCGTAC TCGAATATCC CCGGCGGAAA CGGCGCATTC
CTTGGGCCGC TCGTCGGGAA CCCGGCGGCG TTCGCACAGC CGACAGCACT TACGTTCGGA
AACTCGGGAC GCAATTACCT GCGTAACCCG GGCTACACCA ACTGGAACAT GTCGCTCTTC
AAGAACTTCA AGCTCAGCGA GCGCTTCAAT CTCCAGTTCC GAAGCGAAGC CTTCAACATC
TTCAACCACA CCGAGTGGGC TTCGGTTGGC GGCGACGCCG GCTCCGCTGC CGGCAACGGC
CTGCAGTCCT ACACCAACTC CTTCGGAGGA GACAATTTCC TGTACATCGG AGCTGCCCAT
CCGCCGCGCA TTCTGCAACT CGGTTTGAAA CTTGTCTTCT AG
 
Protein sequence
MKSFRHVAIS VFLFVVLTGI GLAQNTSQIT GSVRDSSGAA VPNAEVVVSS PERGIERPTK 
TNDAGEYAVS GIPAGSYNLK VTAQGFKSYE AKGIVLRVAQ KTRADADLQI GGTTTEVTVA
GESIGQVETQ SSDMSGVVTG KEISQLQLNG RNFTQLVTLV PGVSNQTGQD EGTVGIAGNV
SFSFNGGRTE YNNWELDGGD NMDNGSNATL NVNPSLDSIA EVKVLTSNYG AQYGRSGSGT
VEVETKSGTS SFHGDAYEFV RNDAFNAKSY FSYTEPLIPA YKKNDYGYTL GGPIFIPGHY
NESKQKSFFF WSQEWRKERV PAPFNIPVPS AAERAGDFSD QCPGNSCPHM ADGSPYPGNI
VPIDPTGSAL LALIPGANLG SGASSVYNAS PTQPTYWREE LFRIDHNIND KWHVTFRYTH
DSWNTINPTS QWTGSAFPTV QTNFVGPAIS MVGRVTTTFT PTLVNEFVMS YTTDHITFSS
TGTPNPNAWQ RPQDLAMGYL FNNGFGGKLP AITVSDPAYG GGFYEDPNGE WPEGAYNSNP
TYTFRDNLNK IIGRHNLQFG AYYVAAQKNE LSGILVNGSL GFDSTSAVST GNAFADMLTG
NIASFSQGSD NIKFYNRYKI LEPYFQDDWR VTPKLTLNLG IRLSAFGTYR EKDNHAYNWD
PKAYDPTSAP VFNADGSVSG GNIYDGLVQC GKSSVPEGCM SGHLWNWAPR VGFAWDPFGT
GKTAVRGGYG IFYEHTNGNE ANTEGLEGQS SPLIQTASQS SVVGYTNLGV AAGLDAQFPL
SFISVPTSAT WPYMQQWHFD IQHEIMKDTV LVVAYVGSKG THLGRQSDIN QLLPTPLADN
PFKAGEVITS DVCNNMMTPS GVAVTGQAAT NLAVACGADA NPFRPYLGIG TITRLENESG
STYHAFQLAA RRNVGQLQLN VAYTWSHSID DASDRYDGSF VDAYDPRLNR ASSSFDIRHM
LNVGYVWDMP FFKDRGWKNI LLGGWELSGI TSFQTGTPFS VPNGGAYGDN AGVGNGVGTG
SYADVVSDPY SNIPGGNGAF LGPLVGNPAA FAQPTALTFG NSGRNYLRNP GYTNWNMSLF
KNFKLSERFN LQFRSEAFNI FNHTEWASVG GDAGSAAGNG LQSYTNSFGG DNFLYIGAAH
PPRILQLGLK LVF