Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1744 |
Symbol | |
ID | 5137607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1865371 |
End bp | 1866825 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640533201 |
Product | hypothetical protein |
Protein accession | YP_001217683 |
Protein GI | 147674253 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTCT TCCCGACACG TACCCTGTTA TGTCTATGCA TTGCAGCGCC ATGTCTTCCT GCCATCGCAC AAAATGATCC TATCGAATTG CCGGATATTG GCACGGTAGC GGGCTCGACG CTGACCATAG ATCAAGAACT GATTTATGGC GATGCTTATA TGCGTATGCT GCGCAATAAC CAACCTGTGA TTAACGATCC TGTGCTCAAT GAGTACATTG ATAATTTAGG ACATCGTTTA GTCGCCAGCG CCAACGATGT CAAAACGCCG TTCACCTTTT TTATGATCCG CGATCGTAAC ATCAACGCTT TTGCCTTTTT TGGTGGTTAC GTCGCCTTGC ACTCTGGGTT GTTCCTTCAT GCGCAAAGTG AAAGTGAGTT AGCGTCGGTC ATGGCGCATG AAATTGCTCA CGTCACCCAA CGCCACCTTG CGCGCAGCAT GGAAGAACAA GCACGCCGCT CTCCCGCGAC AATCGCAGCG CTCGCCGGTT CATTACTGCT GGCGATTGCC GCCCCAGAAG CAGGAATTGC GGCGATCAAC GCCACCATGG CGGGCAGCAT CCAAGGCCAG ATTAACTACA CGCGTAGCAA TGAAAAAGAA GCGGATCGAT TTGGTATCGC GACCTTAGCC AAAGCCGGAT TTGACGCCAA CGCCATGCCG CAATTTTTCA CTCGCCTTGC TGATGAATAT CGCTACGCCA GTAAGCCGCC CCCTATGCTG CTGACTCACC CACTACCAGA AGACCGGATT ACCGATAGCC GTGAGCGGGC CAGACAATAT CCGCCACTCA AACTTGCTCC ACACTTGGAT TATCATTTGG CGCGCGCACG GATCATCGCT CGTTATGCAG GCATTGATGC CGACGCAGCG TTGGATTGGT TTGCTCGCAG TGAGAAAAAA ATCGACGCCA CCCTACAGCC GTCTATCCAG TACGGCAAAG CCTTGGTCTA TCTCGATCTC AAACAGTTCG ATAAAGCAGA GCCACTGTTG ACCCAGCTAG TTAAAGAACA ACCGGACAAT CATTTTTATC TCGATGCGAT CAGCGATTTG TATATTGAGC TCAAGCAAGC CGATAAAGCA CAAAGCTTGT TAGAAAAGGC GCTCAAGCAG ACGCCAAATA ACTCAGTGTT GACCATTAAC TATGCGAATG TGCTGCTTAA GCAAGATAAG TTCACCGATG CCATTCGAAT CTTGCAACGT TACACCCATG ACAATCCTAA TGACATCAAT GGTTGGCAAC TACTGTCTGA AGCCAATAGC CGTTTAGGCA ACAGTGCGGA AGACTTAGCG GCACGCGGTG AAATCATGGC GCTGCAAGCA AACTGGAACA AAGCGATTCA GTTTTATACC CAAGCCAGTC AGTTGGTGGA ATTGGGTAGC TTGGCGCAAG CCCGTTACGA TGCGCGGATT GACCAGTTAA TGGTGCAACG CGAGCGCTTT TTATCCCTCC AATAA
|
Protein sequence | MKFFPTRTLL CLCIAAPCLP AIAQNDPIEL PDIGTVAGST LTIDQELIYG DAYMRMLRNN QPVINDPVLN EYIDNLGHRL VASANDVKTP FTFFMIRDRN INAFAFFGGY VALHSGLFLH AQSESELASV MAHEIAHVTQ RHLARSMEEQ ARRSPATIAA LAGSLLLAIA APEAGIAAIN ATMAGSIQGQ INYTRSNEKE ADRFGIATLA KAGFDANAMP QFFTRLADEY RYASKPPPML LTHPLPEDRI TDSRERARQY PPLKLAPHLD YHLARARIIA RYAGIDADAA LDWFARSEKK IDATLQPSIQ YGKALVYLDL KQFDKAEPLL TQLVKEQPDN HFYLDAISDL YIELKQADKA QSLLEKALKQ TPNNSVLTIN YANVLLKQDK FTDAIRILQR YTHDNPNDIN GWQLLSEANS RLGNSAEDLA ARGEIMALQA NWNKAIQFYT QASQLVELGS LAQARYDARI DQLMVQRERF LSLQ
|
| |