Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1678 |
Symbol | |
ID | 4069346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2029530 |
End bp | 2032490 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983686 |
Product | hypothetical protein |
Protein accession | YP_590753 |
Protein GI | 94968705 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.675955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGAAAA TTCAGGTGCT GGTTTTCGCG CTGCTATGCG CGCTTGCGTC GTTTGGTCAA GAATTGGGCA CTGCTACCCT GACGGGGACG GTTGTCGATC CGTCGGGAGC AGTGATTGGG GGCGCAAAGG TCAGCGCGCA GAACAAGGCC ACTGGGATGC AACGAGAGGC TCTGAGCACG AAGGCAGGGA TTTTCGTGTT TAACGATCTT GCTCCGGGTG AGTATGACCT GACGATAAAG GCGGATGGAT TTGCCGTGAC CACCGCGACA GTGCGCATAA CTGTAGGTCA ACAGGCGAAC CTGCGCGCGG AAGTGCGGGT GGCGCAGTCG GAACAACATA TTGACGTTTC GGGCGCGATT CCGCTGGTGG AGACGAGCTC GTCCGTGGTG GACGGCGTGG TGGATTCGAA GCAGATTGAT GCCCTCCCCC TGAATGGCCG TAACTTCCTG GAACTCGCAT TGCTGATGCC GGGGAATGCG CCGGCGCCGC TGTATGACCC GACAAAGTCA GACACCGTTT TGGTTTCTTC GACGGGACAG CTTGGACGCG GTCAGAACAT CACGATTGAT GCGTCAGACA ACAACGATGA TGTGGTGGGC GGAATGCTGG TGAACATTCC GCAGGATGCA GTGCAGGAAT TTCAGATTGC GACGAACCGG TTCTCGGCGG AATTGGGGCG ATCGACTTCG TCGGTGGTGA ACATCCTTAC GAAATCGGGC ACCAACGATC TGCACGGGAC CGCAGCGATC TTTGCACGCG ATGCGGCACT GCAGGCAAAA CCCAATTCGT TTGGACAGAG TGTTGGAGAT ACGCCGCCAT TCAGGCGCGA TCAGTACACC GGCTCCATCG GCGGACCTGT GGTGAAGGAC AAGGCGTGGT GGTTTACGTC TTTCGAATAT CGCGACCAGA TGGGCGGCAT TTTGGTGGGC GAGCGCGATC TGGCAACGCA GTCTATTCAC ACGACCTTTG CTGCTGCGCC GCTGACGGAT GCGTTGGGGA CAGCGAAGTT TGACTGGGCG ATTTCGTCGA AGGACACGCT CTCGACGCGC TACTCCACCG AGAGCTTCGA AGGGACCTCC AATAGCGCGA CCGACCGCGC GCTCGGCACG GCGTCGCAGA CGCAGGCTTC GACGAACCGC TTCCACGACA TCAACACGAA CTGGACGCGC GTGATTTCGG CTGCATTGGT GAATCGCGCG CAATTCGCCG TGAACCTCTT CCAGAACGAC ACGGTGGCGA ACGGTACCGG GCCGCAGATT AATTTCCCGA GTATCGAGGC GGGTTCGTCG TATCGCGTGC CGCAGGCGAC GCACCAGCAA CGCTTGTTGT GGGGGGACAC GCTGGATTGG ACCCACGGCA AGCACAATCT TAAGTTTGGT GCGCAGATGC AGCGCGTGGG ATCCGACTTT AACCTTGGCG TCTTCCAGCA GGGAATTGTG AACGCCGTGG AAGATTTTCC AGACTTCGAT CGCAATGGCG ATGGAGTGGT GAACGACAAC GATCTGCTGT TTGCAGTAGG GCTGGTCAGC CATACGCCGA CGCGCCCGCT GATTATTCCT GATGCCGACA ACAACTATGT TGCGTTGTTC GCGCAGGACG ACTGGCGGGT GCATCCGCAG TTGACGTTGA ACGTCGGTCT GCGATGGGAG CTCGATACCG ACGTGAAGAA CGTTGGACAT TATGACGAGA TCAATCCGCT GGCAAAGCCG TTCCTGCACG GCGATCGTTC GGCGGACTAT ACGAACTTCG GGCCACGCAT TGGTTTCAAC TGGGCCAACA AGCCGGGGAC GTTGAGCGTG CATGGCGGAT ACGGGATGTA CTACGACCGC ATCGTGCTGG AAATTGTGTC GTTGGAGCGC GGCGAAGATG GTCGCGCGCT GGCGATTGAT GTGCATGCAG GAAATGTCTT TCCGGGATAT ATGAATCCGG ATGGGACCTT CATCCCGGGT GTGACGCCGA CACTCGCGGA TCCGTTTACG GGATTCATTC TTCCCGGCGC GGGCGCGGGT GGAATTTACG CCATCAGCAA CCAGATGCAG AACCCGATGG TGCAGCAATT CAATCTCGGA GTGCAGTGGG AGTTCCTGAA GAACTGGGTT GTACGTGCCG ATGGAATGCA CGACTTTGGA CAACACTTCA TCATCGGCGT GCCAGTGGGC ACGGTTTACA ACCCTGTGGT TGGTGGACCC GACACGGTGA AGATTCTTGA GTCGGCCGTG AACACGCATT ACGACGCGCT GTTCCTAACC GTGGACCATA AGTTCTCGAA CCACTTCAAC CTGCATTCGG CTTATACGCT TTCGAAGTCG TTGAACTATG CCAACGACGA CCAGATTCCG TTTGCGAATG GGCCGATTGA TCCGACAGAT CTGCATCGCG AATACGGACC GACGCCGAAT GACCAGCGGC ATCGCTGGGT AACGGCCGCG ACGGTTTCGT TGCCGTATGG AATCCAGTTC TCGCCGTTGT GGACGCTGGC TTCGGGTGTG CCGATGGATA TCCAGCTTCC CGATGGAAGT TCACGTGTGC CGGAGATGCA ACGCAACGCG GGTGGGCGCG AGTTCCACAA TGCAGCGGAG CTGAATGCGT TCATCACGCA GTTGAATGCG GCGGGCGGAT CGAACGGAAC GTTGCTTCCG CTGGTGAGTC CGAATGCCAA GTTTGGCGAT TCGTTCAACT CGTTCGATAT GCGGTTGTCG AAGACGTTCC GGTTGGGAGA CAGGATGTCG CTCGAAGTGC TGGGCGAATG CTTTAACGTC TTCAACACGA CGAACGTGCT GGGCGTATCG AATACGAATT ACTCGGGATA CAACAACGTG TTGGTGCGCG ATAGTAACGA TCCGACCAGC GCTGGATACC TCACGTCGTC GACGTTCGGC ATGCCGAAGA CAACGGCGGG TGGGGTGTTT GGCTCGGGTG GCGCACGGGC CTTCCAGTTG GCAGCGCGGT TTAACTTCTA G
|
Protein sequence | MKKIQVLVFA LLCALASFGQ ELGTATLTGT VVDPSGAVIG GAKVSAQNKA TGMQREALST KAGIFVFNDL APGEYDLTIK ADGFAVTTAT VRITVGQQAN LRAEVRVAQS EQHIDVSGAI PLVETSSSVV DGVVDSKQID ALPLNGRNFL ELALLMPGNA PAPLYDPTKS DTVLVSSTGQ LGRGQNITID ASDNNDDVVG GMLVNIPQDA VQEFQIATNR FSAELGRSTS SVVNILTKSG TNDLHGTAAI FARDAALQAK PNSFGQSVGD TPPFRRDQYT GSIGGPVVKD KAWWFTSFEY RDQMGGILVG ERDLATQSIH TTFAAAPLTD ALGTAKFDWA ISSKDTLSTR YSTESFEGTS NSATDRALGT ASQTQASTNR FHDINTNWTR VISAALVNRA QFAVNLFQND TVANGTGPQI NFPSIEAGSS YRVPQATHQQ RLLWGDTLDW THGKHNLKFG AQMQRVGSDF NLGVFQQGIV NAVEDFPDFD RNGDGVVNDN DLLFAVGLVS HTPTRPLIIP DADNNYVALF AQDDWRVHPQ LTLNVGLRWE LDTDVKNVGH YDEINPLAKP FLHGDRSADY TNFGPRIGFN WANKPGTLSV HGGYGMYYDR IVLEIVSLER GEDGRALAID VHAGNVFPGY MNPDGTFIPG VTPTLADPFT GFILPGAGAG GIYAISNQMQ NPMVQQFNLG VQWEFLKNWV VRADGMHDFG QHFIIGVPVG TVYNPVVGGP DTVKILESAV NTHYDALFLT VDHKFSNHFN LHSAYTLSKS LNYANDDQIP FANGPIDPTD LHREYGPTPN DQRHRWVTAA TVSLPYGIQF SPLWTLASGV PMDIQLPDGS SRVPEMQRNA GGREFHNAAE LNAFITQLNA AGGSNGTLLP LVSPNAKFGD SFNSFDMRLS KTFRLGDRMS LEVLGECFNV FNTTNVLGVS NTNYSGYNNV LVRDSNDPTS AGYLTSSTFG MPKTTAGGVF GSGGARAFQL AARFNF
|
| |