Gene VC0395_A0674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0674 
Symbol 
ID5136094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp703074 
End bp704633 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content55% 
IMG OID640532132 
Producthypothetical protein 
Protein accessionYP_001216624 
Protein GI147673735 
COG category[S] Function unknown 
COG ID[COG4383] Mu-like prophage protein gp29 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTC AATTTCTCGA CGCTCGCGGC CAGCCACTCA AAGCCGACAA AACCGTACTC 
GCCGAAGACA TTGCCCGTGC TTACACCACG GGCGTGCGCA ACCCACGCCC TGCCAGTGTG
GCCTCAACCA TTACGCCGCA GCGCCTTGCA GGCTTGCTGC GTAGCGTAAT TGATGGCACA
GACCCAGAAG CGTACATGAC GCTCGCGGAA GAAATGGAAG AGCGAGACCT GCACTACGCA
GCGCAGTTGC GCACCCGTAA GCTCGCCGTG GCAGCGATTG AGCCAAGCGT GGAAGCCTAC
AGCGATGAAG CCAATGATGT GCTGATGGCA GAGCGCGTGC GCGAAATCAT GACCGACGAC
ATGATCCCTG AGCTGCTGTT TGATTTGCTC GATGGTTTAG GCAAGGGCCT TGCCGTAGTG
CAAGTGCTGT GGGATACCAA GAAAACCCCG TGGAAGCCGA GCGATTATAA GTGGGTTGAC
CCTCGTTACC TGCGCCAAGA CCAAGAAACC CTAGAGCAGA TCTTGCTGAT TAGTGATGAT
GCCCCAACGG GCGCGCCGCT AGAGCCTTAT AAGTTCATCG TGCATACGCC GCGATCTAAG
TCTGGCAGCG TGTGGCGCAA TGGCCTAGCG CGCTTAGTGG CAGTGATGTA CATGCTCAAA
TCGTTCACCG TGCGCGATTG GTGGGCGTTT GCCGAAGTGT TCGGCATTCC GGTTCGGGTC
GGTAAGTATG GCGCGAACGC TAGTGAGGGC GATATCAGCA CGCTGATTAA TGCCATTGGC
CGCATCGCCA GTGATGCGGG TGCGGTGATC CCAGAGTCAA TGAAGATTGA CTTGCTAGAA
ACGGCCAAAG GCAATGGCGG CGACACGCTA TTTGAAAACA TGGCGCGTTG GTGTGATGAA
CAGATTTCAA AAGCCGTACT CGGCCAAACC ATGACCGCCG ACAATGGCAG CTCTCAATCG
CAAGCAAACG TTCACAACGA AGTGCGGATT GATATTGCCA AGTGGGATGC GCGCCAACTC
GAATCTTGCA TCAATGAATT CTTGGTTAAG CCTTACATCA TCCTCAACTG GGGTGTGCAA
GAGCATTACC CGAAAGTGCG CATCAAAATC CCAGAGCCGG AAGATCTCAA AGTTCTGGTC
GATAGCTTAA CGCCACTGAT CGACCGTGGC CTACGCGTGA GCGCTTCATC CGTGCGTGAT
AAGTTCGGCC TGAGTGAGCC AGAGAACGAA GAAGAAGTGC TGGTGCCTAT GGCGCAAGCT
TCCATGCAAT CTCTAGAGGT TGGCCTAAAC CATTCGCAAG GTATTGCGAT CAACCGCATC
AGCCAAAGCG TAGACGCGGA GGTTGATGCG ATGACCGATG AAGCCGTTAG CGAATGGGTG
GAAACTGGCG AAGAGTTTAT GAACCCGATC TTAAAGCTCG CCAAAGACTC GGCCAGTTAT
GATGCGTTCT TGGCTGGCCT GCCTGCCTTG CAAGCGGAAC TCAGCGAGGG TGAGTTTGTT
GAACAGATGG CGAAGCTGAT GTTTCAGGCT CGCGGTTTAG GAGATGCGCG CGATGCCTAA
 
Protein sequence
MSIQFLDARG QPLKADKTVL AEDIARAYTT GVRNPRPASV ASTITPQRLA GLLRSVIDGT 
DPEAYMTLAE EMEERDLHYA AQLRTRKLAV AAIEPSVEAY SDEANDVLMA ERVREIMTDD
MIPELLFDLL DGLGKGLAVV QVLWDTKKTP WKPSDYKWVD PRYLRQDQET LEQILLISDD
APTGAPLEPY KFIVHTPRSK SGSVWRNGLA RLVAVMYMLK SFTVRDWWAF AEVFGIPVRV
GKYGANASEG DISTLINAIG RIASDAGAVI PESMKIDLLE TAKGNGGDTL FENMARWCDE
QISKAVLGQT MTADNGSSQS QANVHNEVRI DIAKWDARQL ESCINEFLVK PYIILNWGVQ
EHYPKVRIKI PEPEDLKVLV DSLTPLIDRG LRVSASSVRD KFGLSEPENE EEVLVPMAQA
SMQSLEVGLN HSQGIAINRI SQSVDAEVDA MTDEAVSEWV ETGEEFMNPI LKLAKDSASY
DAFLAGLPAL QAELSEGEFV EQMAKLMFQA RGLGDARDA