Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3840 |
Symbol | |
ID | 4070991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4543883 |
End bp | 4547251 |
Gene Length | 3369 bp |
Protein Length | 1122 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985863 |
Product | hypothetical protein |
Protein accession | YP_592914 |
Protein GI | 94970866 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAGA TGAGGCAAGC GATAGCGAGG GCGCGCATAG GCAGTCCTTG GAAGAGTGTG AGCACGATCT TGATCATGCT CTGCCTGACG GTCGGAGCGA TGGCACAGAG GATCACCGGC ACATTGCGCG GTCAAGTAAC CGATTCCGCC GGCAGCGTGG TGGTTGGTGC GAAGGTCACC GCCGCAAACC AAGACAGCGG GGTCACCGAG AAAACGGCAA CCAACTCGGC CGGCACCTAC ATATTTCCGG AACTGTTGCC GGGACCGTAT ACCGTGACGG TGCAATCGGA GGGCTTCGCC ACGAGCGCGG TCCGCGACGT GCGGGTCGCG ACCAATGTGG TCAACGACCG GAACGTCTCA TTGGCAGTCG GCGGGTCGAC GACCACCATT GATGTAAACG CGGCTGCCGA GACGGTGGAT CTCTCTTCGT CGACGGTTGC CACTACGTTT GATACGCGAG AAACTCTCGA TATTCCCAGC GGATCCAACA GTCCGCTGCA ATTGGCATTG TTTTCGGCGA ACACTACGGC GCAACAAGGC GGCGTAACCG GCACTGGCGG ATCCGTTTCC GGTACACGTC CACGGTCCAA CTCCTTCAAC ATTGACGGCG TGGACGATAA CAACGCGGGC ACTTCCGGCC AGATCTCGAA CGTCATTCAG GATGCAGTCG CCGAATTCAA CCTGGTGACG AATCCGTTTT CCGCGGAGTA CGGGCACGCG GGCGGCGGGC AATTCAACAT CGTCACCAAG ACAGGCACCA ACAGTTGGCA CGGGTCAGGG GAGTACTACC TGCAGAACCG CTTCCTGAAT GCGCTCGACA ATTTGACGAA AGATGCAATC GCGCAAGGCG CCATCGACCA CACTCCCAGG CTTGACGTCA GCCGAGTAGG TGGGACGATA GGTGGCCCGA TCATCAAGAA CCGCTGGTTC ATCTTCGGCG CCTATGAATA CTTCGACCAG CGCGCGGACA GCCTCGGAGC CGACATAGAA ACGCCTACGG CAGCAGGCAT CAGCACTTTG CAGTCGCTGG CGGCAACTCC GTACATTGCG AACCTGATTG GCACGCTGCC TGCGGCGCAA ACCGCAAACT CCGCGCCGCT TCTGGTGAAC GGCGTCAGCA TTCCCACCGG CTTGGTTCCC CAGGTGGCGC CCAATCCGTT TAAAGAACAT GATTTTCAGA TCAATAGCGA CCTAAAAGAA GGTCGCCATG AACTGAGCGC GCGCTTCCTG TTCAACAAGC AAGACATCAT AACCGCCGGT GCGATCAGTA CGCCTGAATA CAATCTGCCT ACCACCCTCA CAAATTACAA AGCGGCCTTG ATTGACACCT GGTCGATCTC CAACACGCTG GTCAACGACC TGCGGATCTC GTACTCCCAT TCTCTGCAGA GCCTCGCGGT TCCAGAGCCC TTCAGCAACA ACCCACTCAT TTTCCTGGCC GACATGAATG GCATCACCTT CGGGGCCAAT GATCCGCAGA GGATCTTGCA GGATGTGTAC CAGGTCATCG ATACGCAAAC CAAGATTTTC GGCCGGCACA CATTGAAGTA TGGCGGTGAA TATCGTCATT ACATCGCGCC GGCATTTTTC CTGCAAAGAT CCAACGGGAA CTACTTCTAT CTTTCGACAC AGACGTTTAT CAACGACGGC GTGCCGGAGA TCCAAATGCT CCGCGGCGCC GGAGACCCGG TGTTTCCGGA AACGCAGAGT GCCGTCGGGG CATTCCTGCA GGATGACTTC AAGGTGAGCA ATCGGCTGAC CCTGAACCTC GGTCTTCGAT ATGAGTTCAC CAACAATCCC TCTGGCGCAG AACGTCAGGC GAAGAACGCA ATTTCAAATG TTCCCGGAGT GATCGACTTT CACGCACCGA ACACCGCGAA GCTGGATTTC GAACCGCGAA TCGGATTTGC GTGGGACCCC ACCGGAACCG GCAAGACCTC GGTTCGTGGC GGAATCGGAT TGGGATATTC GCCTCCGGTT AACAACTTCA ACCAGAATGC CCAGCCGCCG CAGGTACAGA CGGTCCTCAA CCTGGGAACA GCGTGCTTCG GCGGCCTGAC CACTGCTCCT GCATGGTGCG CTTCAGGAGA CCACTTCTTT GCGCAGGGAG CACTGCCGTC AACCTATACG CTCGCGCCCG GTCCCGACAT ACCGCGGGCA CTCACGGCTT CCATCATGCC AGACACGATT GATGCTCGCA TCGTGAACTG GAGCCTCGGC ATTCAGCGCG AGGTTTACGC TGGTGGCGTT TTGGATGTGC GCTACGTTGG TTCGCGGAGT TTCCATCTGC CGACCCAGAT CCGACTGAAC AGCATTAGTG CTTTCGATGC CGGTCTGACT GCGTTGCCGA CCTACTTCAG CAACTCCGAA GTCCCGTCGG CCGTTCCGAA CCCTGCATCC ACGCAAGCGG ATTTCAAAAC CTTCCTTGCA AACCAGGGGT TCGCGCCGTA TTCACAGTAC GGATTCGTCA ACGTGCTGAC CGAGATCGGG CCGTTTGGTG CGAGCGTCTA CCACGGCGCT TCTGTGTCGT TCACACAAAG TCTGCGCCAC GGCTTGACGA TGCGTGCGAA TTACACCTGG TCGCATAACA TCGACAACGC AACGAACGAA CTGAACTCGA GCTCGGTGAA TCCGCGCCGT GCTGAAGATA GCTACGACCT CGATGCGGAA CGGGGAAATT CAGTGCTGGA TGTACGGCAT AAGTTCGCCG TAGCCTGGAC CTATCACACC CCGAACCTGA CAAGCGGTAG CCGTTTGCTG AGAGCGCTGG CGAACGGGTA CGAAATCAAC GGCGACTTCA TCGCCCAGAC CGGTCAGCCG GTTACGATTC TGTCTCCTTA CGATGCGAAT GCAAACGGCG ACACCGCTGG CGAGCGCGCG ATCTTCAATC CCCACGGTTC CCAGAACCTC TCTACCGATG TGAACTTCGT TTGTAACGAC GGAAGCGGTG GCGCGACCAG AATCGTGAAT CCGCAAGATG CCGCGGCAAC GCCGTGTTCG CCTTCGAACG TGGTGGGCTA CGTCAGCATG AATTCAGGCG CTGCTTTCGT TGCCAGTCAA CTTGGTGCGC GTTCGAATCT CGGCCGTGAC ACGGTGTACT CGCCGGGATT CGGAGTATGG AACGCGTCGC TCGGGAAGAG TTTCCGGGTG ACGGAAGGGA AGTCGTTCCT GGCGCGCGTA GAAGTTTACG ACGTATTCAA TCATCGCAAC TTTACGGTGG CCGGTCCGGT TACGGTGTTC GGAACCGCAA CTGGACTCCA AGCGTTCAAC CTTGGCTATG TGCAAACGGG GAGCAGCCAG TTCCTGGATT CCAAGCAGTT TACCGGCGGC GCCCGTGCTT TGCAGTTGGT CTTCAAGTTC ATCTTCTGA
|
Protein sequence | MKQMRQAIAR ARIGSPWKSV STILIMLCLT VGAMAQRITG TLRGQVTDSA GSVVVGAKVT AANQDSGVTE KTATNSAGTY IFPELLPGPY TVTVQSEGFA TSAVRDVRVA TNVVNDRNVS LAVGGSTTTI DVNAAAETVD LSSSTVATTF DTRETLDIPS GSNSPLQLAL FSANTTAQQG GVTGTGGSVS GTRPRSNSFN IDGVDDNNAG TSGQISNVIQ DAVAEFNLVT NPFSAEYGHA GGGQFNIVTK TGTNSWHGSG EYYLQNRFLN ALDNLTKDAI AQGAIDHTPR LDVSRVGGTI GGPIIKNRWF IFGAYEYFDQ RADSLGADIE TPTAAGISTL QSLAATPYIA NLIGTLPAAQ TANSAPLLVN GVSIPTGLVP QVAPNPFKEH DFQINSDLKE GRHELSARFL FNKQDIITAG AISTPEYNLP TTLTNYKAAL IDTWSISNTL VNDLRISYSH SLQSLAVPEP FSNNPLIFLA DMNGITFGAN DPQRILQDVY QVIDTQTKIF GRHTLKYGGE YRHYIAPAFF LQRSNGNYFY LSTQTFINDG VPEIQMLRGA GDPVFPETQS AVGAFLQDDF KVSNRLTLNL GLRYEFTNNP SGAERQAKNA ISNVPGVIDF HAPNTAKLDF EPRIGFAWDP TGTGKTSVRG GIGLGYSPPV NNFNQNAQPP QVQTVLNLGT ACFGGLTTAP AWCASGDHFF AQGALPSTYT LAPGPDIPRA LTASIMPDTI DARIVNWSLG IQREVYAGGV LDVRYVGSRS FHLPTQIRLN SISAFDAGLT ALPTYFSNSE VPSAVPNPAS TQADFKTFLA NQGFAPYSQY GFVNVLTEIG PFGASVYHGA SVSFTQSLRH GLTMRANYTW SHNIDNATNE LNSSSVNPRR AEDSYDLDAE RGNSVLDVRH KFAVAWTYHT PNLTSGSRLL RALANGYEIN GDFIAQTGQP VTILSPYDAN ANGDTAGERA IFNPHGSQNL STDVNFVCND GSGGATRIVN PQDAAATPCS PSNVVGYVSM NSGAAFVASQ LGARSNLGRD TVYSPGFGVW NASLGKSFRV TEGKSFLARV EVYDVFNHRN FTVAGPVTVF GTATGLQAFN LGYVQTGSSQ FLDSKQFTGG ARALQLVFKF IF
|
| |