Gene VC0395_A2817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2817 
SymbolmshI 
ID5136897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2967382 
End bp2968821 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content49% 
IMG OID640534261 
ProductMSHA biogenesis protein MshI 
Protein accessionYP_001218667 
Protein GI147674439 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3166] Tfp pilus assembly protein PilN 
TIGRFAM ID[TIGR01709] general secretion pathway protein L 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAC CGAGTTGGAT AGAGAAACTG ATTGCCCCTA AGGTTGCCTC GCAACAGTTA 
TATGTTGTGG TGCAGCCAGA GCACCTGCAC TTCACATCCG ATGATTTATC GCCTATTCCT
CCCCAGCCGT TACAGCAACA AAGTTGGCAA GCGGTATTGG TGCAAACGCT GCAAAAGCAC
GCTGTCCATG ATGTACAAAT CCACCTTGTA CTGCATTCAC AGCTTTATCA AACTTATCAG
ATTGAACAGC CGAGTATTCC GCGTGAAGAG TGGTCGGCAG CCTTGCCTTT CTTGCTCAAA
GATATGTTGA GTGAGAAAGT GACGGATGTA GTGGCGGATG CTCACCCTCT TCCCGGCAGT
GGTAAAGTAC AAGCCTATGT GATCAGCAAG CGTACTATTC TTGAGTTGCA AAGCATGGCG
GTGTCGGCGG GATTAACGCT AGGACGAGTG ATCCCCGAGC AAGCGATTTG GGGATTGGTG
GGCGGAGAAT TGAGCCACTT CCTGTTGCTG CATCGCAGCA TGGGTGGCAG CTTTAAGCTG
GATGCTTTCG TTGATCGTCA GTGCAGTTTT CAACGCACTT TACGTGGGAT CACTGCGCCT
GTGACTGATA ACGCAGCCAG TGCCCTTCAG TTAGATAGCT TGGCGTTAGA GCTACAGCGC
TCGATTGATT ACTTATCCGC CCAGTTAAAG GGCGGCTCTT TACAACAGCT AAAAGTGTGT
TGTGATGGTG AAGATCAACA GGCTTTGATC ACCGGACTCA ATGAGCGCTT AAGTGTGCGA
GCGTCAGGAC TGGATGGTGA AGCGACCATC TGTGGTGAAC AACTGGCACG TTATGCGCGC
AATATCCCGC AAGAAACCAT CAATTTCTAT CAAGATCACC TCAAGCCGAA GCGTGAAAAG
TTCACCTTAA CCAATCTCTT GTTAGCGTGG TTGGCCTTGA GCGTTGTGTT ATTGCTTGGG
TACGCAGGGG TGGGTTATCA AAACTGGGTG ATCCAACAGC AGTGGCAAGA GCAGCAACAA
CATAATCAAT CGTTAACAGA ACAAGCGGCT CACTTACGTC AGCAGGTGGC GGTTCATCTT
CCTTCGCCCG CTAAACAGGC GGCGATAGGG CGCATAAAGC AAGAGATCTC TAGCAAACAG
CAAGCATTAG ACGCGATTGG GCAGTTTGAT GTGGCTCAGC AAACGGGCTA TTCCGGCGTA
TTGAACTCTT TGGCTCAATT GGCGCGTAGC GATATCTCTT TAAGCAGTAT TACTTTGGAT
TCCTCGCAAT TAAATGTGCA GGGACTCGCT CGTGATCCTG CCGCGATTCC AAACTGGATC
AGTCAATTTA AACAAGAACT GCATCTGATG GGCAGAAGCT TTGAGCAACT GAAAATTGGC
CGTAATGATC AAGACATGAT CACCTTTGAA CTCAACACTC AGCGAGGAGA ACAAAGATGA
 
Protein sequence
MKKPSWIEKL IAPKVASQQL YVVVQPEHLH FTSDDLSPIP PQPLQQQSWQ AVLVQTLQKH 
AVHDVQIHLV LHSQLYQTYQ IEQPSIPREE WSAALPFLLK DMLSEKVTDV VADAHPLPGS
GKVQAYVISK RTILELQSMA VSAGLTLGRV IPEQAIWGLV GGELSHFLLL HRSMGGSFKL
DAFVDRQCSF QRTLRGITAP VTDNAASALQ LDSLALELQR SIDYLSAQLK GGSLQQLKVC
CDGEDQQALI TGLNERLSVR ASGLDGEATI CGEQLARYAR NIPQETINFY QDHLKPKREK
FTLTNLLLAW LALSVVLLLG YAGVGYQNWV IQQQWQEQQQ HNQSLTEQAA HLRQQVAVHL
PSPAKQAAIG RIKQEISSKQ QALDAIGQFD VAQQTGYSGV LNSLAQLARS DISLSSITLD
SSQLNVQGLA RDPAAIPNWI SQFKQELHLM GRSFEQLKIG RNDQDMITFE LNTQRGEQR