Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0674 |
Symbol | |
ID | 5136094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 703074 |
End bp | 704633 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640532132 |
Product | hypothetical protein |
Protein accession | YP_001216624 |
Protein GI | 147673735 |
COG category | [S] Function unknown |
COG ID | [COG4383] Mu-like prophage protein gp29 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATTC AATTTCTCGA CGCTCGCGGC CAGCCACTCA AAGCCGACAA AACCGTACTC GCCGAAGACA TTGCCCGTGC TTACACCACG GGCGTGCGCA ACCCACGCCC TGCCAGTGTG GCCTCAACCA TTACGCCGCA GCGCCTTGCA GGCTTGCTGC GTAGCGTAAT TGATGGCACA GACCCAGAAG CGTACATGAC GCTCGCGGAA GAAATGGAAG AGCGAGACCT GCACTACGCA GCGCAGTTGC GCACCCGTAA GCTCGCCGTG GCAGCGATTG AGCCAAGCGT GGAAGCCTAC AGCGATGAAG CCAATGATGT GCTGATGGCA GAGCGCGTGC GCGAAATCAT GACCGACGAC ATGATCCCTG AGCTGCTGTT TGATTTGCTC GATGGTTTAG GCAAGGGCCT TGCCGTAGTG CAAGTGCTGT GGGATACCAA GAAAACCCCG TGGAAGCCGA GCGATTATAA GTGGGTTGAC CCTCGTTACC TGCGCCAAGA CCAAGAAACC CTAGAGCAGA TCTTGCTGAT TAGTGATGAT GCCCCAACGG GCGCGCCGCT AGAGCCTTAT AAGTTCATCG TGCATACGCC GCGATCTAAG TCTGGCAGCG TGTGGCGCAA TGGCCTAGCG CGCTTAGTGG CAGTGATGTA CATGCTCAAA TCGTTCACCG TGCGCGATTG GTGGGCGTTT GCCGAAGTGT TCGGCATTCC GGTTCGGGTC GGTAAGTATG GCGCGAACGC TAGTGAGGGC GATATCAGCA CGCTGATTAA TGCCATTGGC CGCATCGCCA GTGATGCGGG TGCGGTGATC CCAGAGTCAA TGAAGATTGA CTTGCTAGAA ACGGCCAAAG GCAATGGCGG CGACACGCTA TTTGAAAACA TGGCGCGTTG GTGTGATGAA CAGATTTCAA AAGCCGTACT CGGCCAAACC ATGACCGCCG ACAATGGCAG CTCTCAATCG CAAGCAAACG TTCACAACGA AGTGCGGATT GATATTGCCA AGTGGGATGC GCGCCAACTC GAATCTTGCA TCAATGAATT CTTGGTTAAG CCTTACATCA TCCTCAACTG GGGTGTGCAA GAGCATTACC CGAAAGTGCG CATCAAAATC CCAGAGCCGG AAGATCTCAA AGTTCTGGTC GATAGCTTAA CGCCACTGAT CGACCGTGGC CTACGCGTGA GCGCTTCATC CGTGCGTGAT AAGTTCGGCC TGAGTGAGCC AGAGAACGAA GAAGAAGTGC TGGTGCCTAT GGCGCAAGCT TCCATGCAAT CTCTAGAGGT TGGCCTAAAC CATTCGCAAG GTATTGCGAT CAACCGCATC AGCCAAAGCG TAGACGCGGA GGTTGATGCG ATGACCGATG AAGCCGTTAG CGAATGGGTG GAAACTGGCG AAGAGTTTAT GAACCCGATC TTAAAGCTCG CCAAAGACTC GGCCAGTTAT GATGCGTTCT TGGCTGGCCT GCCTGCCTTG CAAGCGGAAC TCAGCGAGGG TGAGTTTGTT GAACAGATGG CGAAGCTGAT GTTTCAGGCT CGCGGTTTAG GAGATGCGCG CGATGCCTAA
|
Protein sequence | MSIQFLDARG QPLKADKTVL AEDIARAYTT GVRNPRPASV ASTITPQRLA GLLRSVIDGT DPEAYMTLAE EMEERDLHYA AQLRTRKLAV AAIEPSVEAY SDEANDVLMA ERVREIMTDD MIPELLFDLL DGLGKGLAVV QVLWDTKKTP WKPSDYKWVD PRYLRQDQET LEQILLISDD APTGAPLEPY KFIVHTPRSK SGSVWRNGLA RLVAVMYMLK SFTVRDWWAF AEVFGIPVRV GKYGANASEG DISTLINAIG RIASDAGAVI PESMKIDLLE TAKGNGGDTL FENMARWCDE QISKAVLGQT MTADNGSSQS QANVHNEVRI DIAKWDARQL ESCINEFLVK PYIILNWGVQ EHYPKVRIKI PEPEDLKVLV DSLTPLIDRG LRVSASSVRD KFGLSEPENE EEVLVPMAQA SMQSLEVGLN HSQGIAINRI SQSVDAEVDA MTDEAVSEWV ETGEEFMNPI LKLAKDSASY DAFLAGLPAL QAELSEGEFV EQMAKLMFQA RGLGDARDA
|
| |