Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1502 |
Symbol | |
ID | 5137217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1614749 |
End bp | 1615918 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640532960 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_001217445 |
Protein GI | 147675765 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000018122 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGAAA TACTGTTCCT GTTATTGCCC ATCGCTGCCG CCTACGGCTG GTACATGGGG CATCGTAGCG CCCAGCAAGA TAAACAGAAA CAATCACACC AAATTTCGCG TCAATACGTG ACCGGTTTGA ACCTCTTACT CTCCGATCAA TCGGATAAAG CGGTTGATCA CTTTATCGAA TTACTGCAAG TCGATAATGA AACCATTGAT ACCCATCTAG CACTCGGTAA CCTTTTCCGG TCTCGTGGCG AAGTGGATCG CGCTATTCGT ATTCACCAAA ACCTTATCTC TCGCTCTGGG TTGACCATAG ACCAGAAAAA CTTGGCGCTG CAGCAGTTAG CTAAGGACTA TATGGTGTCC GGTTTTCTTG ATCGTGCTGA GAAAATATTT GAACAACTGA TTGATGAGCC TGAGCATCGT GAATCTGCTC TCCAGCAGCT CACGGCAATT TATCAACAGA CTCGTGAGTG GCACAAAGCG ATTGAGTGTG CCACCGCTTT GGTTAAGCTT GGGCGCAAAC GTATGAAGGT GAACATTGCG CATTTCTACT GCGAGCTGGC CATGCTTGAA AAGGCGGACA GTAACGATAA CAAAGCCATT CAACTGTTTA AAAAAGCATT ACAAGAAGAT CCGAAATGTG TACGTGCAAC CATTTCACTC GGTAAACTCT ACCTACAAAA CGAAGACTAT CAGAAGACTA TTGACCATCT AGAGATGGTT CTTGAGCAAG ATATCGATTT TATCGGTGAA GTGCTCAATA CGTTGGCTGA GTGTTACCAC CATTTGGGGC GAGAGCAAGA TTTGATCACC TTCTTACGTC GCTGTATCGC CAATAAAGCG GGGGTATCGG CCGAATTAAT GCTCGCTCAG CTGGTTGCAC AGCATGAAGG TATTGCCGCT GCACAAGAGA TTTTAACTCG TCAGTTGGTT AAAAATCCTA CTATGAAAGG CTTTTACCGG TTGATTGATT ACCATATTGC CGAAGCGGAA GAGGGCAGAG CCAAAGCCAG TCTCTCTACG TTACAACGCT TAGTGGGTGA ACAGCTTAAA GTAAAACCCC ATTACCGTTG CCGTAAATGT GGCTTTTCGA CCCATTCACT TTATTGGCAT TGCCCATCAT GTAAAAACTG GGGTTCAATC AAGCCAATCA GAGGCTTAGA TGGTGAGTAA
|
Protein sequence | MLEILFLLLP IAAAYGWYMG HRSAQQDKQK QSHQISRQYV TGLNLLLSDQ SDKAVDHFIE LLQVDNETID THLALGNLFR SRGEVDRAIR IHQNLISRSG LTIDQKNLAL QQLAKDYMVS GFLDRAEKIF EQLIDEPEHR ESALQQLTAI YQQTREWHKA IECATALVKL GRKRMKVNIA HFYCELAMLE KADSNDNKAI QLFKKALQED PKCVRATISL GKLYLQNEDY QKTIDHLEMV LEQDIDFIGE VLNTLAECYH HLGREQDLIT FLRRCIANKA GVSAELMLAQ LVAQHEGIAA AQEILTRQLV KNPTMKGFYR LIDYHIAEAE EGRAKASLST LQRLVGEQLK VKPHYRCRKC GFSTHSLYWH CPSCKNWGSI KPIRGLDGE
|
| |