Gene Acid345_0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0417 
Symbol 
ID4068736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp480503 
End bp484006 
Gene Length3504 bp 
Protein Length1167 aa 
Translation table11 
GC content58% 
IMG OID637982421 
ProductTonB-dependent receptor 
Protein accessionYP_589496 
Protein GI94967448 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0498154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCTT GCAGTCGAGT GGTCGCGCTC TTGCTGCTGT GCTTTGCCGT GGTTGGCTTC 
GCCTTTGGGC AGGCTGATAC CGGTCAGGTC ACCGGCGTAG CGACCGACCC ATCCGGAGCT
ATCGTGGCCG GCGCCAAAGT CACCATCACC GACGTGAATA CCGGCGCGAC ACGTACATCT
ACGACGACCC AGAGCGGGAC GTTCGCATTC ACGAACTTGA AACCTTCTGC GTACGACGTC
GTTGTCGAAG CAGAGAAGTT TGCGCGCTTC ACCCGTCGCT TGGATGTCAG CGTCGGCTCG
CGAAATGATC TGCAGGCGCA ACTCGCAGTG ACGTCAGCCG GTACAACCGT GGACGTCACC
GCCGAAACTG AAGCGGCGCA GGTGAATACC GAATCGCAGA CGTTGTCCAA CGTGGTCACC
GCGAAGCAGA TTGCGGAATT GCCAACTCTC ACGCGCAACG CTTATGACCT CGTTGCCACT
GCCGGCAACG TGGTGGAAGA CACCACCGAC TTCGTGCGCG GCACTGGTTT CTCGATCAAC
GGCCAGCGCT CGGCTTCGAC CGATATCCTG CTCGATGGCG GCGAGAATGT CGATCTCTAT
GACGCCGCAG TTGGGCAGAA CGTCCCTCTC GATTCAGTGC AGGAATTCCG CATTGTCACC
AGCGACTTCA CGGCGGAGTA CGGCCGCGCA GGCGGCGGGG TCGTAAACGT TGCCACCAAG
TCGGGCACCA ACGCTTTCCA CGGTACCGCT TATGACTTCA ACCGGGTTTC GGCTCTCGCC
GCCAACTCTT GGGAGAACGA CGCAAACGGC ATCGACAAAG GACATTTCAC ACGCAACCAG
TTTGGCTATT CCATCGGTGG TCCGGTGGTG AAGAACAAAC TATTCTTCTT CTCCAGCACC
GAGTGGACGC GCGTTCGCAG CACCAACAAC AACATCGCGA CGATCATTGA TCCCGCGTTC
CTCGCAGCTC CGGAAGTCAG CGACAACACC AAGGCGTTCT TCCAGGCCTA CGGTGCTCGC
AAGGACAACC TGGACGTCCT CGACGTTCAA ACGTGGGCAC AAACGCCACA CACCACCAGC
ACAGGGCCGG CCGACTCAAC TCCAACCCTG GACAAAGTTT CCTACCAGAT CGCCGGCAAC
TCCGGCGGTG GCGCTCCCCA GAACTCGTAT TCAACGGTCA ATCGCGTGGA TTGGAATGCG
ACCGACAAAA CCACAATCTT CGGCCGTTAT GCTCTCGAGA GCCAGGATTA CTTCCCCGGA
TACATCAGCT TCAGCCCGTA TTCCGGCTAT GACACTCCGG ACCTTCTCTT CCGTAACAAC
GTGCTCATCA ACATGACGCA CGTGTTCAGC CCGAATTTCG TTAGCCAGAG CAAAATTGCT
TACAACCGGC TGAACGAATC GCAACCGTTC TCCACTGCGC CGGTTGGCCC GACGCTTTAC
TCCAACTATT TCGGCCTGCC CACCATCAAT GGCGGACTGT TGCTCTTCCC CGGTTACTCG
CCGAGCACTC CCGGCAACTC GATTCCATAC GGCGGTCCGC AGAACCTCTA CCAGCTTTAC
CAGGACCTCT CCTGGACAAA AGGCAAGCAC CAACTCCGCT TCGGAGGCCA GTACATTCAC
ACCCGCGACA ACCGCACCTT CGGAGCCTAT GAAGGCGCCG CCGCGTATCT GAATAATGCC
TTCGATGTCG GGGATGCCTT CGACCAACTC GTCGCAGGTA ATACCAAGCG CTACCAGGTT
GCGGTCGATC CCCAAGGAAA ATTCCCATGT CCTTACACTA CCGATGGCGC TTATCAGGAA
CTCGATTCCT GCAAAGTCTC GCTGCCGGTC AGCTCACCCT CGTTCACGCG CCACAATCAC
TACAACGACG GCGCGTGGTA TGCACAGGAC ACCTGGAAGT TCACGCCTCG GCTCACTCTG
AACCTCGGCC TACGCTGGGA ATACTACGGC GTACAGCACA ACGTGGATAC CAGCCCCGAA
TCCAACTTCT ACCTCGGACA GGGCTCAGGC ATTCTCGAAC AAATCCGCAA CGGTTCCGTT
CAGCTCGCCG GCCAGGGTCC GACAAGCGGA TTGTGGGCAC CCGACCGCAA TAACTTCGCG
CCGCGCGTTG GCTTCGCGTG GGATGTTTTC GGCGACGGGA AAACCGCGAT CCGCGGCGGC
TACGGCATCA GTTACGAACG CAACTTCGGC AACGTCACCT TCAACGTAAT TCAGAATCCT
CCGGGACAGT ACGTGATGAT CCAGGCTGCC CCGATTTCGG TGGATAACTT CGGTCCTCTC
GCACCTGGGA GCGGAGCCAA TCCGTACCTG CGGCCGGGTA GTTTACGCGC GGTAAACCAG
AACATTCCGA CCGCATACAC GGAATCCTGG AGCTTCGCAG TTGAACGCCA AGTAATAAAG
AACAGCGTTC TCGCGTTCGA ATACTCCGGG GCTCACGGCG TACATCTGTA CGACATCGCC
AATATTTCCG GCGCCGGTAT GGGATACGCC TTCTTCGGCG ATCCCGACAG CGTCCGACTC
ACTGGCACGA ACCTGCAGTA CTCGGCAATC AACTATCGCT CGGCGAACGG GTTTAATACC
TACGGCGGCT TGAATACCCG ATTCACCACC GACAACCTGT TCAACCTCGG TCTTCAGCTG
AACTTCAACT GGACCTGGTC TCACTCACTC GATAACCTCA GCAACACGTT CAGTGAGGCG
GGCAACGGCC CATTCCAACT TGGTTACGTT CACCCGTTTG ACCCAGTCCT CGACAAGGGC
AACTCCGAAT ACGACGCACG CCACCGTTTC GTCGTCAGCG CAGTTTGGCA AGTGCCGTGG
GGCAAGAAAG TAAACAACAG CGTGCTTCGG CAAGTTGTGG ACGGCTGGTC ATTGTCGCCG
CTGTTCAGCT ACCACACCGG TTATTCCTAC AGCATCTTCG ACGGAACCAA CGCCTTCAAC
AACGATGGCC GTTGGATTCC CGGAGCTGCA ACTTCCAGGG ACGGTTCTGC CAATCGCAAC
AGCTATGTTG GCGGCGGCGT CTTCAACTAC ATGGGTCTGC CGTGGGATCC CAACAATATG
ACTCTCGTGA ACCTCGGTCA AACCACTGCT CCGGGCGCCG TTGCTGGCAC GGGCGTCTCG
TGGGGCGATC TGCCGATCAA TCCGGCGACC GGCTTGTCCA GCTGTCCCAC CGTTGGCAGC
CTCGTGGGTT GCAGCTACGG TCCAAACACC GATCGTAACC AGTTCGTAGG CCCGGGGAAC
CATCAATTCA ACGCGGTCAT CGGAAAGACT TTCCGTCTCT CCGAGCGTTT CAGCATGCAG
TTCCGCGGCG AATTCTACAA CGTGTTTAAC AACCACAATT ACTACCTGCT CACTACGAAT
GCTGATGTCT TCGGCATGTC GGTAGCAGCG AATGGAGACT CGAGCCTAAG CGCGGGACGG
AACATCCAGG CTGTAAAGGG TGGCTACGGT AACGCTCTCG ACGAAACTCG CAACATCCAG
CTCGGGCTTA AGCTCATTTT CTAA
 
Protein sequence
MNSCSRVVAL LLLCFAVVGF AFGQADTGQV TGVATDPSGA IVAGAKVTIT DVNTGATRTS 
TTTQSGTFAF TNLKPSAYDV VVEAEKFARF TRRLDVSVGS RNDLQAQLAV TSAGTTVDVT
AETEAAQVNT ESQTLSNVVT AKQIAELPTL TRNAYDLVAT AGNVVEDTTD FVRGTGFSIN
GQRSASTDIL LDGGENVDLY DAAVGQNVPL DSVQEFRIVT SDFTAEYGRA GGGVVNVATK
SGTNAFHGTA YDFNRVSALA ANSWENDANG IDKGHFTRNQ FGYSIGGPVV KNKLFFFSST
EWTRVRSTNN NIATIIDPAF LAAPEVSDNT KAFFQAYGAR KDNLDVLDVQ TWAQTPHTTS
TGPADSTPTL DKVSYQIAGN SGGGAPQNSY STVNRVDWNA TDKTTIFGRY ALESQDYFPG
YISFSPYSGY DTPDLLFRNN VLINMTHVFS PNFVSQSKIA YNRLNESQPF STAPVGPTLY
SNYFGLPTIN GGLLLFPGYS PSTPGNSIPY GGPQNLYQLY QDLSWTKGKH QLRFGGQYIH
TRDNRTFGAY EGAAAYLNNA FDVGDAFDQL VAGNTKRYQV AVDPQGKFPC PYTTDGAYQE
LDSCKVSLPV SSPSFTRHNH YNDGAWYAQD TWKFTPRLTL NLGLRWEYYG VQHNVDTSPE
SNFYLGQGSG ILEQIRNGSV QLAGQGPTSG LWAPDRNNFA PRVGFAWDVF GDGKTAIRGG
YGISYERNFG NVTFNVIQNP PGQYVMIQAA PISVDNFGPL APGSGANPYL RPGSLRAVNQ
NIPTAYTESW SFAVERQVIK NSVLAFEYSG AHGVHLYDIA NISGAGMGYA FFGDPDSVRL
TGTNLQYSAI NYRSANGFNT YGGLNTRFTT DNLFNLGLQL NFNWTWSHSL DNLSNTFSEA
GNGPFQLGYV HPFDPVLDKG NSEYDARHRF VVSAVWQVPW GKKVNNSVLR QVVDGWSLSP
LFSYHTGYSY SIFDGTNAFN NDGRWIPGAA TSRDGSANRN SYVGGGVFNY MGLPWDPNNM
TLVNLGQTTA PGAVAGTGVS WGDLPINPAT GLSSCPTVGS LVGCSYGPNT DRNQFVGPGN
HQFNAVIGKT FRLSERFSMQ FRGEFYNVFN NHNYYLLTTN ADVFGMSVAA NGDSSLSAGR
NIQAVKGGYG NALDETRNIQ LGLKLIF