Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2156 |
Symbol | |
ID | 4068793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2575459 |
End bp | 2577699 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984172 |
Product | amino acid transporters |
Protein accession | YP_591231 |
Protein GI | 94969183 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.739886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.876305 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACAA TTCCAAAGCC AGCCGTAAAG GTCTTCGTTG CCACTACGGT GATGCTGACC TTCATCTCTT TTTGGCGCGC GGCCGCCATT GTTCTCTCAG ACTTAGCGTC TTCCGCATAC TACGTCGGTG GCGACGCCGA AAAGGTCATC GGCAAGAGCG CACCCTGGTT CGTCCTAGCG GTCATGCTGT TCAGCTATGC GGTCCGCGCC ATTTATATCG AGTCGAGCTC GATGTTCGTG CGTGGAGGCG TCTACAAGGT CGTCAAAAGC GCCATGGGCG GTCAGCTTGC AAAGCTGTCG GTATCCGCGC TGCTCTTCGA CTACGTCCTG ACCGGCCCGA TTAGCGCTGT TTCTGCCGGA CAATATCTCG TCGGCTTCCT GCAGGACCTG TCCACGCGGC TTCATCATCC GATTACGTTA TCCGACGCCG GCGTGAATTA CTGTGCCGCC GTCTTCGGCT GTCTCGTGAC CGGTTATTTC TGGCACAAGA ACATCCAGGG CATTCACGAA AGCAGCGAGA AGGCGCTTCG CATCATGCAG ATCACCACGG TCATGGTGGT GATTTTGATC GGCTGGTGCC TGGCAACGCT CGTGATTCAT CCCAGTACGG CGCACCTTCC ACCTTTGCCA TCGGGCCACA ACCTGCTCAA GACGGACGAA TCTCTGGGTT GGCTGCACGG GACGATGTTC GCCAATCTGA CCTGGATTCT GTTGCTGGTG GGTTTCGGAC ACTCCGTGCT GGCAATGAGC GGCGAGGAAT CGCTCGCACA GGTGAACCGC GAGATCGAGC ACCCGAAACT CAAGAACCTG GAAAAAGCGG GCCTGGTCAT TTTCTTGTAC AGCCTGCTGT TCACCGGCCT GGTTTCGTTT TTCGCGGTGA TGATTATTCC CGACAATGTT CGCCCGAATT ATTTCGCGAA CTTGATCAGT GGATTGGCGA TGAACGTTGC CGGTCCCTAC AACGTAAAGC TTCTGTTCCA AGGCTTTGTC GTGATCGTGG GTGCACTGAT CCTGTCGGGA GCAGTCAACA CGGCCATCGT GGGCGCCAAC GGCGTTCTGA ACCGTCTCAG CGAAGACGGT GTAATGACGC CATGGTTCCG CAAGCCACAT CATCGGTTCG GGACAAGCAG CCGCATCATC AACCTGATTG CTGGCTTGCA GATCGCGACG ATTCTGGCGA GCCGCGGCAA TGTTTACCTG CTGGCTGCTC TGTATGCCTT CGGCGTGATC TGGTCCTTCT CGTTCATGTC GCTGGCGGTG TTCGTCCTGC GATTTACGAG TCCAGAGGGC CGCGAGTGGC GGGTGCCAGG CAACATTCGG ATCGGCGGGA AGGAAATTCC GGTCGGCGTT GGACTGATTG CGGTGTTGCT CTTCTCGATC GCAATCGTCA ATTTGTTCAC CAAGACGCTT GCGACCAAAT ACGGCATTGC CTTCAGCATC TTCCTGTACG TCGTGTTCAC GATTTCGGAG CGGCTTAACC AGAAGACCGT TGCTGGTGGC GGCCACGATC TCGAACAGTT CCGCGTGCAG GCGCAAGACG ACATCACCGC GGAAGCGATG GAGGTCCGAC CGGGGAACAT CCTGGTAGCG GTCCGCGACC CTCGCAATCT CTTCTACCTG CGCGAGGTGT TGAAGCACAC CGATACGACC AAGCAAGACG TCGTGGTCAT GACGTCACGC CTCTACCACC GCGAGTACTC GTTCAGCGGC AACACCAACC TCGACAGCTC AGAGGTCTTT GAGGAGTATG AGCGTCAGTT GTTCACCGCG GTCGTGAACG AGGCTGAAAA ACAGGGCCGC CACGTTTCAC TGCTGGTTGC GCCGACGAAC GATGTGTTTG AAAGCATCGT GGCCACTGGC GCGCGTTTGC ACTCGACCGT AATCGTCTGC GGGTTGTCGA ACAAATTGAC CCCCGAAGAG CAGGGCAAGC TGACCGGCGA CGCCTGGGAA CGCCTACCTG ATCCCAAGCC GCGCATGAGA TTGATTGTGG CTTCGGCGGA CGGACAGAAA TGGGAGTTCG AGCTTGGTCC GCACACCCCG CGCATGCGTC GCGAAGACTT GAAGCTCATG CACGACATCT GGCTCCAGGT CACCCGCGAT CCGGCATACA GCAAGCTGCA TCACTACCAC GTCATCGCCG TCGCGTTGAA AGAGCTGCAG CAGCGGCTGA ATGGCACCGA GAGCGCCGCG GCGCTTTCCG ATATTCGTGA CGAGATCGAG CGCAAGGACG AAGAGTTCTG A
|
Protein sequence | MSTIPKPAVK VFVATTVMLT FISFWRAAAI VLSDLASSAY YVGGDAEKVI GKSAPWFVLA VMLFSYAVRA IYIESSSMFV RGGVYKVVKS AMGGQLAKLS VSALLFDYVL TGPISAVSAG QYLVGFLQDL STRLHHPITL SDAGVNYCAA VFGCLVTGYF WHKNIQGIHE SSEKALRIMQ ITTVMVVILI GWCLATLVIH PSTAHLPPLP SGHNLLKTDE SLGWLHGTMF ANLTWILLLV GFGHSVLAMS GEESLAQVNR EIEHPKLKNL EKAGLVIFLY SLLFTGLVSF FAVMIIPDNV RPNYFANLIS GLAMNVAGPY NVKLLFQGFV VIVGALILSG AVNTAIVGAN GVLNRLSEDG VMTPWFRKPH HRFGTSSRII NLIAGLQIAT ILASRGNVYL LAALYAFGVI WSFSFMSLAV FVLRFTSPEG REWRVPGNIR IGGKEIPVGV GLIAVLLFSI AIVNLFTKTL ATKYGIAFSI FLYVVFTISE RLNQKTVAGG GHDLEQFRVQ AQDDITAEAM EVRPGNILVA VRDPRNLFYL REVLKHTDTT KQDVVVMTSR LYHREYSFSG NTNLDSSEVF EEYERQLFTA VVNEAEKQGR HVSLLVAPTN DVFESIVATG ARLHSTVIVC GLSNKLTPEE QGKLTGDAWE RLPDPKPRMR LIVASADGQK WEFELGPHTP RMRREDLKLM HDIWLQVTRD PAYSKLHHYH VIAVALKELQ QRLNGTESAA ALSDIRDEIE RKDEEF
|
| |