Gene Acid345_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0783 
Symbol 
ID4068564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp967343 
End bp970468 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content58% 
IMG OID637982790 
ProductTonB-dependent receptor 
Protein accessionYP_589862 
Protein GI94967814 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.149411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.179647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGAT ATATCCGTTA TGTGTTGAGC TTAGCGCTCC TGCTATGCGC GACACATATG 
TTATTCGCCC AGGCGACCGC TAGCGCGAAT GTCGCCGGCA CCGTGTATGA CAAGACGTCC
GCGGTCATCG CCGGCGCACA AGTCACCATC ACGAGCAAAG CGACCGGACA AACGCGTACT
TCCAACACCG ACAGCAACGG CGCGTACCGC TTCGACCTGT TGTCTGCCGG TCAATACAGC
ATCAAGATCA CCAAGTCCGG ATTTGCAAGC CTTTCCGAAA ACGTAGAGCT TCTGGTCGGA
CAAACTTCGA CCATCAACGG TGTCTTGAAT CCAGGCTCCG CCAGCGAAGT GGTGGAAGTG
ACCGAAGCCG CGCCGCTCGT GGATGTCGCC AAGACCTCCG TAAGCACGCA GATCACGCCC
AGCGAAGTGC AGGAAATGCC GCTTGTCGGA CGTGACGTTG CCAACCTCGC CTACCTCGCT
CCGGGCGTGA AAGCCGCGGA CTCCTACGAT CCCACCAAGA ACCGTTACGC GATTCTCTCC
GTGAACGGCG CCGATGGCCG TAACGTCAAC GTGACTGTCA ACGGCGTGGA TAACAAGGAC
AACACGGTTG GCGGACCGGT CATGCAGCTT CCGCTCGAAG CCGTGCAGGA ATTCCAGATC
AGCACCCAGC GTTTCTCGGC TGAAAACGGC CGTTCGCAGG GCGCCGCCAT CAACATGATC
ACCAAGTCGG GTACCAACAT GTACCACGGC TCCGCCTTCG GCTTCTTCCG TACCTCGGCC
CTCGACGCCG ACGAAATGGT TCCGGATGGC ACCGGTGGCG CCTTCCACTC ACACCCCGAC
TACAGCCGTC AGCAGTTCGG CGGCTCCTTC GGCGGCCCGA TTATCAAGGA CAAGCTCTTC
GGCTTCTTCG CACTCGAACA TGAGCGTGAA CGCCAGGGCC TCAGCGAGAG CGGCACTTCG
TTCGACGAAC TCTCCCTCGC GGCGGGTGCT GGACTCGCAG CTCAGCCGTC AGCAGTCATT
CCGCGTCCGT TCAACGAAAC TCGCTATAGC GGCCGTCTCG ACTGGAACGT CAACAGCAAG
AACTCCGCGT ACCTTTCCTA CAACTCGCAG GTGAACGACA GCCTGAACGA TCAGTCGGAC
GGTACCGGCG ACCTGACCAA CGGTAACTTC ACCAAGAACC ATCTGCAGTT GGCCAACTTG
ACTTGGAATA CCTTGCTAAC GTCGCACCTG ATCAACCAGT TCACGTTCGG GTGGCAGTAC
TGGAACAACC TGATCGACAG CGACATCAGC GCGCCGCTGG TCACGTTCCC GAACGCTTCG
TTCGGCACCA ACACCAACGT TCCGCAGCAG TCGTTCCAGC GCAAGTTCCA GTTTAAGGAC
GACATCAGCT GGACCCACGG AAAGCACACC TTCAAAGGTG GCGTGGACTA CATCTGGAAT
CCAGTGGAGG GCGGCTTCTT CGAGTACAGC TCAACTCTCG AAATCGATTT CGGCGCCGAC
CCGAGCTGCA TTTTGGCCGC CGCTACCGAC GACGTCAACA AGTGCGGTCC GGGATATTAT
CCGCAACAGT TCGCCACCAA GGGCGCTGTC ACCGGCATGA CCATCGCCAA CGGCGATCCG
CAGTTCATCG TGCCGACCAA GCAGCTCGGC TTCTACTTCC AGGATGATTG GAAGGTCACG
CCGCGATTGA ACTTGAACCT CGGCATCCGT TGGGACAAGG ACTTCAACAC CTACGGTCAG
TCCGACATCA CGAACAGCCG TACCTACCAG GAATTGGTGG CGATCAACAG CCCGATCACC
AATCCGTATG TCGCGAGCCT CCCGCACGCC AGCAACAAGG ACTTCAGCCC GCGTGTTGGC
TTCGCCTATG ACCTCACCGG TTCCGGCACG CACGTCCTGC GCGGCGGCTT CGGTCTCTAC
TACGGCAACT CCTTCCAGAA CATTCCGTTG TTCATGGAAC AGCAGGCCAA CTCGACGATC
TTCCAGACCC TGTTCAGTTT GAGCGATCCG GTGAACGATG TCGTTCCAGG CACCGGAATT
CCTCTCGGAC AATGGCAGTA TGGAATCAGC CCGATGCCCA CAATTGCCCC GCCGTCTGCT
GACCTCACTG TAGGCAGCAC CGGGCGATTG GTCGATCCTA ACTACCAGAA CCCGGTGTCG
GAGGAATTTA ACTTCGGCTA CTCTTGGGGA GTAACCTCCA ACTCGGTGTT CGAGACGGAG
TTCACGCACG TCCAAAACCT GCATGAAAAC CGCACCATGA ATATCGACCA GAAGGTTCCG
GTCGGTGGTG TTTGCTGCTT CCGTCCTCTC GACGATGCCT TTGCCGCTGC CGGCCAACCG
CGCTTGAACA GTGTGCGCGA TGAGCAGTCC ATCGGCCGAT CCCGCTACGA CGGCATCAAC
TTCGGCTACC GCCAGCGCAT GACGCATCAC TTCATGTTCA ACGCGTACTA CACCCTGGCC
TGGGCCGACG GTTACAACAC CAACGGCAAC TATGCCTTCC GCAACTACCC GTTCCTCGCA
ACCGATCCGT TTGCCAAATA CAACTGGGGA CCGACTTATT CCGATGAACG TCACCACGTG
ACCATCAGCG GTTTGGTTGA TTTCCCCTTT GGCATCCAGG CTTCGCCGAT TTTGCAATAT
GGCTCGGCGC GCCCGTACGC TCTGACCAAC TCTTACAACA CGCTCAACAC CGGTAACGGC
ACGGCGACCG CCGTAATCGT CCCCAAGGGA CAGACGAATA ACTACCTCTA CGGCACGAAC
TACATCGCGT CGTACGTTGC CGCTCATCCG GGCGACGACA ATGCAACCTC GGAAGCGCAG
CAGAACCTCC AGATGTGCTT CTACAACGGC GATTGCACCC TCGCGCAGTT CGATCCGCTT
CGTGGCAAGC CGACCTTCGA GCTTGACCTC CGCTTGGCGA AGAACTTCAA GCTGGGTGAG
CGCTTCAACC TGCAGATCAC CGCACAGGCG TTTAACCTGA CCAACGCCCC GAACTACGGA
AACAACTTCA ACGGCAACAT CGCCTCGCCC TCCACGTTCA TGCATCCGGC GGGCTTCATC
AACCCCAGCA GCACCACGAC TCCGCGTTCG CTGTGGAGTG AATACGGAGT ACGCCTCACG
TTCTAA
 
Protein sequence
MRRYIRYVLS LALLLCATHM LFAQATASAN VAGTVYDKTS AVIAGAQVTI TSKATGQTRT 
SNTDSNGAYR FDLLSAGQYS IKITKSGFAS LSENVELLVG QTSTINGVLN PGSASEVVEV
TEAAPLVDVA KTSVSTQITP SEVQEMPLVG RDVANLAYLA PGVKAADSYD PTKNRYAILS
VNGADGRNVN VTVNGVDNKD NTVGGPVMQL PLEAVQEFQI STQRFSAENG RSQGAAINMI
TKSGTNMYHG SAFGFFRTSA LDADEMVPDG TGGAFHSHPD YSRQQFGGSF GGPIIKDKLF
GFFALEHERE RQGLSESGTS FDELSLAAGA GLAAQPSAVI PRPFNETRYS GRLDWNVNSK
NSAYLSYNSQ VNDSLNDQSD GTGDLTNGNF TKNHLQLANL TWNTLLTSHL INQFTFGWQY
WNNLIDSDIS APLVTFPNAS FGTNTNVPQQ SFQRKFQFKD DISWTHGKHT FKGGVDYIWN
PVEGGFFEYS STLEIDFGAD PSCILAAATD DVNKCGPGYY PQQFATKGAV TGMTIANGDP
QFIVPTKQLG FYFQDDWKVT PRLNLNLGIR WDKDFNTYGQ SDITNSRTYQ ELVAINSPIT
NPYVASLPHA SNKDFSPRVG FAYDLTGSGT HVLRGGFGLY YGNSFQNIPL FMEQQANSTI
FQTLFSLSDP VNDVVPGTGI PLGQWQYGIS PMPTIAPPSA DLTVGSTGRL VDPNYQNPVS
EEFNFGYSWG VTSNSVFETE FTHVQNLHEN RTMNIDQKVP VGGVCCFRPL DDAFAAAGQP
RLNSVRDEQS IGRSRYDGIN FGYRQRMTHH FMFNAYYTLA WADGYNTNGN YAFRNYPFLA
TDPFAKYNWG PTYSDERHHV TISGLVDFPF GIQASPILQY GSARPYALTN SYNTLNTGNG
TATAVIVPKG QTNNYLYGTN YIASYVAAHP GDDNATSEAQ QNLQMCFYNG DCTLAQFDPL
RGKPTFELDL RLAKNFKLGE RFNLQITAQA FNLTNAPNYG NNFNGNIASP STFMHPAGFI
NPSSTTTPRS LWSEYGVRLT F