Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0784 |
Symbol | |
ID | 4068565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 970813 |
End bp | 974040 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982791 |
Product | hypothetical protein |
Protein accession | YP_589863 |
Protein GI | 94967815 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.410737 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.275915 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAG CTGTTCTCTT CACCTTCCTC ATCCTGCTGG TGATCTCTTC GATCGCTTTC GGCCAGCAAA CTACCGGACA AATCAACGGC ACCCTCGTAG ATTCCAGCAG TGCAGTCGTT CCCAATGCGA CCGTTACCGC AAAGAACGTC GATAACGGAC TGACCCGCTC CACCAAGTCA AGCACCACCG GCGGGTACAC CATAAATGAC CTTCCCCCTG GAACGTACAC CATCACCACG GAAGCCCCTG GTTTTGCCAA GACCGTGAAT GAGCGCGTTC CGCTGCTCGT CGGCCAGGCC TTGACGCTGA ACTTCACCCT CAAAACGGGC GGCGCCAATG AGACCATCAC CGTCACCGAG GAAGCGCCGC TTATCGAATC GACGCGTTCC GATATCGGCG GCTCAGTATC GCCGTTGGAA GTGAAGGAAC TGCCAATCGT GGACCGCAAC TTCGCGGGTT TGATGGCAAC CGTCCCCGGC GTGCGTCCTG CAGAAGCCTT CGATCCGACC AAGACCCGCT CCGGCAACGT GAGCGTGAAC GGCAGCGACG GCCGCTCCAT TGACTACAAC GTGGATGGCG GCGACAACAA GGACGTCGTC ATCGGCGGCA TCGTCCAGAA CTTCACCATG GAAGGCATCC AGGAATTCCA GGTGACGACC GACCGTTATA CGGCGGAATC CGGCCGCGCA GCGGCTGCTG TAGTGAACGT GATCAGCAAG AGCGGTACCA ACGCTTTCCA CGGGACGGCG TTCAGCCTGT TCCAAAACAG CGGTCTGAAC AGCAACAGCT ACTTCAATGA AATCGCCGGA AACCCGAAAA ACAAATTCCA TCGCTACCAG TTTGGTGGTT CTGCCGGCGG TCCAATCATC AAGGACAAGC TGTTTTTCTT CGGCGCATAC GAGCAGAAGC GCGAACCCCA GGACATTGGC GTCGATCCTT CTGCATTCGA CAATCTCACC CTGTTCGCGG CTGCATTCCC GGACTATGCG GTTCCCATCA GGAAGCTCGA CTACTCGTAC CTCGACCAGC AGCTGACAGC GAAGGTGGAC CATCGCATCA GCGATCGTCA GAACATGTTC TATCGCTATG CGTGGGAAAA ATGGACGAAC CCCAACGATC AGCTTGGATT TCCGTTCGTG GCTGACGCCA GCCAATCCAC TTCTGACAGC AACAGTTTTC ATGACTTTGT AGCGCAGCAC AACTACACGA TCTCGCCGAC CAAGGTGAAT TCGTTCAACT TCCACCTCCA GGACTTCACC AATGACATCC TTCCGGCGCC GGGTCGCACC TTCACCTACG ACGTGGCGGG AGGAGGGACT GCAACCAATC CAGAAATCTG CTTCGGTATC GGCGGCGGAT GCGGCGGTGG CGTGCCGGAA GTCGGCAACA ACGTCAACGT TCCCCAAGAA ACCCTGATTC GCAAATACCA GTTCCGCGAC GACTTCACGT GGGTGCATCG CAACCACAAT ATGAAGATGG GCGTGAACTG GGATTACGTG GACGTAATGG GCGGATTCTT CTACTTCGGC GCCAACGGCT ACCAAGTCAT CTTCCAGGAT GATCCGAAGA CGATCCTGGC GAACCCTGCG GCGTATCCGG ACGGCTTCTC AACCCGCGGC GCCGTCGGTG AACTCACCTA CAACGGCGGC TCCGGTTCGA CCGCACAGCC TCCATCGCAC CAGTTAGCGT TCTACTTCCA GGATGATTGG AAGGTCACCA ACCGCTTCAC GCTGAACGCC GGCGTGCGCT GGGACGCGAA CCCGCTGTTC CTCATCCCGC AGCTCACGAA CAACTTTAGC AGCACGAACC GCACTGTCCG AGTCCTGCGC GACGTACTCG CCGCGAACCT GTCTGATCCA GCGGCACAAG CAGGCGTTCA AAGGGCGGAC TACCTCGCCG GCAATACCAG TCTGGCTAGC AAGAACACCG CCGACTGGAA GGAATTCCAG CCTCGTATTG GTTTCTCCTG GGATCCCACC GGTTCGGGCA AGAGCGTCAT CCGCGGCGGC TATGGCATCG CACGCGACAC CATCTTCCAG AACCTGACGC TCTTCGCGGT TCAGGAAACC AATCCAACCA TTTACAACAC GATCATCGAC TACTTCCCGA GCCAAGCCCC GGGGTCTTGC CCTGCAGGCG GCACTGCCGA TCCAACCGAC CTTTGCAACT TCCGCTTTGG CATCGATCCG CTTCCGGCTC CGCAAGCGGC GACTACGGAC CTCGCACCCG GCGCCGTTGG CCGTATGCAG GACCCGCGTT TGACGGACCC GTGGTCGCAG CAGATGTCCA TCGGCGGCGA ACGCCAGTTC GGCAACGACT ACGCGTTCGG CGCGGATTAC TACCACGTGC TCGGCACCCA TGAACCACGC GTTCTGAACA TGAACCCGAA GATCGGATCG ACCTGTGATC CGGCGTACGG CGGCGATCCT ACCAACCCAA CCTGCGTGAA CGGCGCGGGA ACTCGGTTGA TGGACGCGGC CTTCTCGGAA GCACCAGACA GCCAGATCGC CGGCCAGAAC CTTGGCATCG GGCGATTGGG CGCTATCTAC GATTACTCAA CCTCGAACCG CTCTCTGTAT GACGGCATCA ACTTCCAATT ACGGAAGCGG ATGAGCCACC ACTTCCAATT CCAGGCGAGC TACGTTCTCT CCTGGGCGCG GTCGTGGGGT GGACGCCCGA CGTCGTCCTA TAGCGGCAGC GGCGTCAATG TCACTCCGGA GCAGCAGTTC GCTTCCAACG AATTCAACTA CAGCAGCTTT GACGAGCGCC ATCGCTTCAC GTTGAGCGGC GTCTTCCAAC TGCCGTGGGG ATTCGAAGTT GCACCACTGG TTCAGGCCGC ATCGGCACGC CCGTATGATT TCATCGCTGG TTCGGACATT AACGGCGACG GACGTTCCAC GATTGACCGC GCTTGCGTCG GTAGCACTCC GGGCAATCCG ATCTTCACTA AGGGTTGCAC CATGCTCAAG CCGGACACCC TGCGTGGCGA CCCGTTCTTC CAGATTGATA CGCGCGTCGC TAAAGCATTC AAGTTCAACG AACACATGAC GTTGCAGTTG ATTTGGGAGT TCTACAACAT CGGCAACGTA AACAACTTCT GTAACTACTA CTTCAATAAC GCCAGCCAGT CCAACTTTGG AACGCCGCAG GGATACTGCG GTGGCCAGGG CGGTCCGGCC TTTACGGGCC CGTTCCGTCA GCAGTTCGGC TTCCGTTTCG AGTTCTAG
|
Protein sequence | MKRAVLFTFL ILLVISSIAF GQQTTGQING TLVDSSSAVV PNATVTAKNV DNGLTRSTKS STTGGYTIND LPPGTYTITT EAPGFAKTVN ERVPLLVGQA LTLNFTLKTG GANETITVTE EAPLIESTRS DIGGSVSPLE VKELPIVDRN FAGLMATVPG VRPAEAFDPT KTRSGNVSVN GSDGRSIDYN VDGGDNKDVV IGGIVQNFTM EGIQEFQVTT DRYTAESGRA AAAVVNVISK SGTNAFHGTA FSLFQNSGLN SNSYFNEIAG NPKNKFHRYQ FGGSAGGPII KDKLFFFGAY EQKREPQDIG VDPSAFDNLT LFAAAFPDYA VPIRKLDYSY LDQQLTAKVD HRISDRQNMF YRYAWEKWTN PNDQLGFPFV ADASQSTSDS NSFHDFVAQH NYTISPTKVN SFNFHLQDFT NDILPAPGRT FTYDVAGGGT ATNPEICFGI GGGCGGGVPE VGNNVNVPQE TLIRKYQFRD DFTWVHRNHN MKMGVNWDYV DVMGGFFYFG ANGYQVIFQD DPKTILANPA AYPDGFSTRG AVGELTYNGG SGSTAQPPSH QLAFYFQDDW KVTNRFTLNA GVRWDANPLF LIPQLTNNFS STNRTVRVLR DVLAANLSDP AAQAGVQRAD YLAGNTSLAS KNTADWKEFQ PRIGFSWDPT GSGKSVIRGG YGIARDTIFQ NLTLFAVQET NPTIYNTIID YFPSQAPGSC PAGGTADPTD LCNFRFGIDP LPAPQAATTD LAPGAVGRMQ DPRLTDPWSQ QMSIGGERQF GNDYAFGADY YHVLGTHEPR VLNMNPKIGS TCDPAYGGDP TNPTCVNGAG TRLMDAAFSE APDSQIAGQN LGIGRLGAIY DYSTSNRSLY DGINFQLRKR MSHHFQFQAS YVLSWARSWG GRPTSSYSGS GVNVTPEQQF ASNEFNYSSF DERHRFTLSG VFQLPWGFEV APLVQAASAR PYDFIAGSDI NGDGRSTIDR ACVGSTPGNP IFTKGCTMLK PDTLRGDPFF QIDTRVAKAF KFNEHMTLQL IWEFYNIGNV NNFCNYYFNN ASQSNFGTPQ GYCGGQGGPA FTGPFRQQFG FRFEF
|
| |