Gene Acid345_4190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4190 
Symbol 
ID4072149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4959572 
End bp4962916 
Gene Length3345 bp 
Protein Length1114 aa 
Translation table11 
GC content57% 
IMG OID637986221 
ProductTonB-dependent receptor 
Protein accessionYP_593264 
Protein GI94971216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.351423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCAC AATTTGGTAA GCGAAGAGCG GCATTCGTTA GTGTTCTGCT ATTCGCACTT 
ATCTTTGCTG TTTCTGCGTG GGGCCAAACG AGTAAAGGAA TCATCGCTGG AACAGTCACA
GATACGAGCG GAGCCGTTGT CGCCGGAGCA CAGGTAACAG CGTCAAACAC GGATACCGGC
GAAACACGCA CCGTTCAGTC GGGACCGACG GGTGCGTACC GCGTTGAGGC AGTCACACTC
GGCAAATATC GCATCAACGT ATCGTTCCAG GGTTTTCAGA GCCAGGTTAT AAATGGCGTT
GAGGTTGCCG GATCGGTGAT GACTGCGGTT GATATCAAAC TGCAAGTCGC CAGCGCCGCC
AGTACAGAAG TGACGGTATC GGCGGACAAC AACCAGGTGC AGACCGAAAA TGGCGAGATC
TCCAGCACGA TCACTACGAA AGAATTGACC AATCTACCGC TCGCCTCCCT GAACCCGATC
GAACTAGCAC TGACGCAGCC CGGCGTTATA GACAATGCCG GCCGTGGAAG CACGAACGGC
ATGGGTTTCT CGGTGAACGG TTCACGTCCA CAGAGCAACA ACTTCCTGAT TGATGGTCAG
GACAATAACG ACAACAGCAT TCAGGGCCAG GCGTATCAGC CACAGAATCC GAACGCAGTG
CAGGAAGTCG CCATCATGAC GAACTCCTAC TCCGCGGAAT TCGGTCGTGG CGGAGCTTCG
GTCACCAACG TTATTTACAA GAGCGGCTCC AATCAGTTCC ACGGAACGCT GAGCGAACTC
TACTCGGGCT CCGGATTGAA CGCGATCGAC GCCGCCAACG GCCTTACTGG AATGAAGGAC
GGCGGCGATT GCAACCGAGC GCCGAATTTC TTCCCATGTA AGACGCGTTT CGACTCCCAC
ACCTTCGGAT TTACGTTCGG CGGTCCGCTC ATTAAGGACA AGCTGTTCTT CTTCGGCAGC
GGAAACTGGG AGCGCACCTA CGGTCAGGAG CAGAACAGCA ACATTCTGAT CCCCACTGAA
ACCGGCATCG GCCAGTTGCA GGCGTATGGA TCGGCGAACG CGACTCTTCT GACCCAATAC
CTCGGATCTC TCCGCGGATC TACGGTTGGA GCGACGTGCG TCAACACTGG CATTTCGTCG
TTGCCTTGCG TAGAGATGGG CAATTACACG CCGACCGCGC CGCAGCAAAA CACTGACACG
CAGTGGAACA TCAAGGCGGA CTATCTGCCG CGGCAGACCG ACACCATTAC CTTCAACTAT
CTCCACGACC GCGGCTACTT CGCGCCTGAC TGGTTCGCGA ATCCGGGTTC CGTATTGCCA
GGCTTCGAAA CCTTCCAGGG CGGACCGTCG TGGATCGCGG GCGCTTCGTG GACGCACACC
TTTAGCTCGA ACAAGGTGAA CGAATTCCGC GCGTCCTACG GACACCTCGG CTTCACGTTC
GGACCTACGG CCGCAACTAC AGGGAATCCG CTTTATCTAA TGCCGAGCCT GGCACTGAGC
AGCGCCGGAT TGGATGCATT CCCGCTATTG GGTACTGACT CAGCTTTCCC GCAGGGTCGT
AGCCACCGCA CCATGCAGTT GCAGGATGGC TTCACCATTA CGAAGGGCAG CCACACCTTC
AAGATGGGCG CGGACGTCGC GCGCATCTGG GTGACGGACC AGATTCCAAT CAACACCCGC
GGAACTCTGA CCTTCACCGA TGGTGGCGGC TACACCGCAC TCGGCAACTT CCTCGATAAT
TTCACCGGCA CCAGCGGCCA GGCGTTGGAT CTCCAGACAG GTAATCCTAC GGTGAAGCCC
ACGTTGCTGC AGAGCGGCTA CTACTTCCAG GACAACTGGA AGGTGCGGCC GAACCTGACG
TTGAACCTCG GCGTCCGTTA CGAGTATCAA ACGAATCCCG AGAACTCACT CCCGTACCCG
GCGGTGTCGA ACATCTACGG CGGCGATAAC AATTTCCCGA CGGTGGTGAA GGCGGACCAG
CAGTTCTCGC ACATCGCTCC TCGCATCGGG TTTGCCTACT CGCCAAACTT CCTGCCAAGC
ATCTTCGGAA ACGGCAAGAC CGTTCTCCGC GGCGGCTTCG GCATTTTCTA TGATGCGATC
TACACCAACA TTCTCGATAA TACGGCCTCG TCCGCACCGA ACTCGATCGA CGAGCCGCTG
TTTGGCTCGG ACTCTTCGAA CGCGCGCGGT TTCGCCAACG CAACCGGACA GTTCAGCGGA
CTCTCAGGTA CCTTGAGCCC GTTCAATACC GTTACAAGCA TTGCGAAGGA CCTCACCAAT
CCGCGTACCA CGCAGTGGAA CTTCAACATT GAGCGTGAAC TGCCGGCAGA CATGCTCCTG
ACGGTCGCCT ACCTGGGATC GCGTGGACAG AAACTGTTGG TGAATGACGA CTACAACCCC
TTCGGCGGCT ACGATGCCAG CGGCAACTAT ATTCCGCGCT TCAATTCGAA CCGCGGTGCC
ATGGCAATTC GCACCAATGG CGGTGATTCC TACTACAACG GCCTGGCAGT TACGGTCGAG
CGCAGGTTCT CTCATGGCTT GATGCTGCGC AGCGCATATA CGTTCTCGAA GTCCATTGAT
GACAGCTCGA ATATTTTCGT CATCACGGGC GGATCGTCGT ATGCGCAGGA CCCAACCAAC
CGCCAGGCGG ATCGTGGTTT GTCAGCCTTC AACGCGTTCC ATCGTTGGGC GTTTACCTAC
GTGTGGGACG TACCGGGCTT CAAGGCGCAG GACAACAAAG TGCTGAACGG CCTCGCGTAC
CTCTCCCGTC ATTGGGAATG GACCGGGACC ACGACCCTGC AGTCTGGCTT GCCTGACACG
ATCTACGATA GCTTCGATAG CAGCGGTCGC GGCCACAGTT CGAGTGGACG TCCGGACCTG
TTGAACGCTT CGGCGCCCAT GAATGCGATT GCTTTGCCGG CAGCGTGGGG AGCTTGCGCA
CCGGGCGCGG ATTACTGCGA CCTGGCGACG CTGAACTCGC TCTCTGCCAG CGACCTGAGC
AACTATCATT TCCTGGTTCC GTTTGGCGCG CCGGGTACGG TGGGACGCAA CAACTACATC
CTGCCTGGCC AGGTGAACTT TAACTTCGGA ATCAACCGCA ACATCCCGAT TCCGCGTCAC
GAATCGCAAG TGGTGACGCT GCGCGTGGAA ATGTATAACC CGTTCAATCA CCCGAACCAG
TCGGCACTTC CGAACCAGGG CATGTGGAGC ACGAACGTCT CTGACATGTT CCTGGTTGGA
CAGGACTCGA GTGGGAACCT CAACAACCCG AGCCACCTGT TCGACACCTA CTGGGCACGC
CAGGGTGCGC GCAATATCAA GCTGTTGATC AAGTACCAGT TCTAG
 
Protein sequence
MQPQFGKRRA AFVSVLLFAL IFAVSAWGQT SKGIIAGTVT DTSGAVVAGA QVTASNTDTG 
ETRTVQSGPT GAYRVEAVTL GKYRINVSFQ GFQSQVINGV EVAGSVMTAV DIKLQVASAA
STEVTVSADN NQVQTENGEI SSTITTKELT NLPLASLNPI ELALTQPGVI DNAGRGSTNG
MGFSVNGSRP QSNNFLIDGQ DNNDNSIQGQ AYQPQNPNAV QEVAIMTNSY SAEFGRGGAS
VTNVIYKSGS NQFHGTLSEL YSGSGLNAID AANGLTGMKD GGDCNRAPNF FPCKTRFDSH
TFGFTFGGPL IKDKLFFFGS GNWERTYGQE QNSNILIPTE TGIGQLQAYG SANATLLTQY
LGSLRGSTVG ATCVNTGISS LPCVEMGNYT PTAPQQNTDT QWNIKADYLP RQTDTITFNY
LHDRGYFAPD WFANPGSVLP GFETFQGGPS WIAGASWTHT FSSNKVNEFR ASYGHLGFTF
GPTAATTGNP LYLMPSLALS SAGLDAFPLL GTDSAFPQGR SHRTMQLQDG FTITKGSHTF
KMGADVARIW VTDQIPINTR GTLTFTDGGG YTALGNFLDN FTGTSGQALD LQTGNPTVKP
TLLQSGYYFQ DNWKVRPNLT LNLGVRYEYQ TNPENSLPYP AVSNIYGGDN NFPTVVKADQ
QFSHIAPRIG FAYSPNFLPS IFGNGKTVLR GGFGIFYDAI YTNILDNTAS SAPNSIDEPL
FGSDSSNARG FANATGQFSG LSGTLSPFNT VTSIAKDLTN PRTTQWNFNI ERELPADMLL
TVAYLGSRGQ KLLVNDDYNP FGGYDASGNY IPRFNSNRGA MAIRTNGGDS YYNGLAVTVE
RRFSHGLMLR SAYTFSKSID DSSNIFVITG GSSYAQDPTN RQADRGLSAF NAFHRWAFTY
VWDVPGFKAQ DNKVLNGLAY LSRHWEWTGT TTLQSGLPDT IYDSFDSSGR GHSSSGRPDL
LNASAPMNAI ALPAAWGACA PGADYCDLAT LNSLSASDLS NYHFLVPFGA PGTVGRNNYI
LPGQVNFNFG INRNIPIPRH ESQVVTLRVE MYNPFNHPNQ SALPNQGMWS TNVSDMFLVG
QDSSGNLNNP SHLFDTYWAR QGARNIKLLI KYQF