Gene VEA_001372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVEA_001372 
Symbol 
ID8559684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio sp. Ex25 
KingdomBacteria 
Replicon accessionNC_013457 
Strand
Start bp1551766 
End bp1554192 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content48% 
IMG OID646409041 
Productmicrobial collagenase secreted 
Protein accessionYP_003288520 
Protein GI262396667 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00032964 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCATA TCCGTTTTTT CCCGCGCCAT CGTTTGGCGC TTGCTTGTAT GCTAGCGAGT 
GTGTCTAGCT TCTCTTTTGC ACAAAACCAG TGCGCTGTTG CTGATCTACA ACAATCGAGA
GACTTGGCTG CTGCCGTTTC TGGTGCTGAG TATGATTGCT ACCACGCTTG GTTTTCAGCT
CCTTCCGCAA CGCTGAACGA TATTTACAGC GAAGCGAGCT TAAGCCGTAT CCAAGTCGCG
TTGGATCAAG AAATTGCCCG CTATCGTGGA GAAGCAGAGC AAGCTCGTGT TCTAGAAAAT
CTGGGCGAGT TTATTCGAGC GGCCTACTAC GTTCGTTACA ATGCTGGCAC AGCTACACCT
GAGTTTTCGC AAGCGTTGAG TCAGCGTTTT GCTCAGTCGA CGAACCTATT CCTAAACAAC
CCTCATGCTT TGGATCAAGG CCGTGAACAA GTTGGCGCAA TGAAGAGCTT GACTTTGATG
GTTGATAACG TGAAGCAGTT GCCGCTGACC ATGGATAGCA TGATGGCCGC GTTGATGCAC
TTTAACCGTG AAACCGCGCA GGACACGCAA TGGGTCGATG GTTTAAACAA CCTGTTCCGT
TCTATGGCAG GTCATGTAGC AAACGATGAG TTCTACCGCT ACATGGCAAA CAATACTCAC
CATATTGATA CATTAGCGAG ATTCGCATCT GATAACGCTT GGGCATTGGA TACGGATGCG
AGCTTTATTG TTTTCAATGC ATTACGTGAA ACTGGTCGTT TGCTTGCGAG CCCAGATCAG
GAGACCAAAC GCAAAGCCTT GGCTGTGATG CAACAAGTGA TGCAGCGTTA CCCACTGGGT
AGTGAGCATG ACAAACTGTG GCTTGCTGCC GTAGAGATGA TGAGCTACTA CGCACCTGAA
GGTTTAAACG GTCTGGACCT AGATCAAGCG AAGCAAGATC TAGCAGCGCG CGTGATGCCA
AATCGTTTTG AGTGTCAGGG ACCTGCCATC ATTCGCTCTG AAGATCTGAC CGATGCTCAA
GCTGCGAAAG CGTGTGAAGT ACTGGCTGCG AAAGAAGCCG ACTTCCATCA AGTGGCGAAT
ACGGGCAATC AACCTGTCGC TGATGACTTG AATGACCGTG TTGAGGTGGC GGTGTTTGCT
AGTAACGACA GCTACGTTGA TTATTCTTCG TTCCTGTTTG GTAATACAAC GGACAACGGT
GGTCAGTACC TTGAAGGGAA TCCATCAAAA GCGGACAATA CCGCCCGTTT CGTGGCTTAT
CGCTATGCGA ATGGCGAAGA TTTATCGATT TTGAATCTAG AGCATGAGTA CACGCACTAT
TTGGACGCGC GATTTAACCA GTACGGCTCA TTCAGTGATA ACTTAGCCCA CGGTCATATC
GTGTGGTGGC TGGAAGGTTT TGCTGAATAC ATGCATTACA AACAAGGCTA CCAAGCTGCG
ATCGAACTGA TTGCAGGCGG TAAATTGAGC CTTTCAACTG TGTTTGATAC CACATATTCT
CACGATACTA ATCGTATCTA TCGTTGGGGT TACTTGGCCG TTCGATTCAT GTTAGAAAAT
CATCCGCAAG ACGTAGAAAG CTTGTTGGCG TTGTCGCGCA CTGGTCAGTT TGAACAGTGG
GCGCAACAGG TGACGACATT AGGTCAACAA TATGACGCAG AGTTTGAACG CTGGTTGGAT
ACGTTAGAAG TGGAGCCAGA GCAACCAGTT ACTGACCCAG AAGAGCCAAT AGAACCAACA
GAACCTGAAG CACAAGTGAC CGAATTGACG GCGAACCAAA GCCTTCAACT CAGTGGCGAA
GCATACAGTG AAAAACTGTT CTACGTTGAC GTACCAGCCA ACACCGTTCG CTTTAATGTA
TCTATTGAAG GTGCGGGGGA TGCGGATCTT TACATGAGCT ACAACAAAGT TGCGCATTAC
TACGACTTTG AAATGAGCCA GTATGGCGAT GGCAGCAATG AAGAGATTGT ATTTGAGCCA
GAGCAGGATG GTTACGTGAA AGCAGGTCGT TACTACATCA GTTTAACTGG CCGTGATAAC
TATGACTCAG TGAATCTTGT GGCTGCTTTA GACTTTGAAG CGCAAACACC GCCAACACAA
GTGCAAGACG ATTTGGCACC GGTGGTATTA GAGTCTGGTG AAGCAAAAAT GCTGACCGTT
CACAAGCAGC GTTATGCGGC AGTGTATGTA CCTGAAGGTG TTGAAGAGGT TCGTGTATGG
CTAACGGCTC ACTCTCAAAC AGGTAATGTC GACCTATACG CGAGTAGGGA GCATTGGCCT
ACAGTTGAGC AACACGAGTA CGTGTCCAAC TATGCAGGCA GCAACGAATA CCTTGCGATT
CCTGTTACTG AAGCGGGCTA TGTGCACTTC TCTCTTCAAG CACCAGAACA AGGTGATGAC
GTTGAAATGT TAGTGCACTT CAACTAA
 
Protein sequence
MSHIRFFPRH RLALACMLAS VSSFSFAQNQ CAVADLQQSR DLAAAVSGAE YDCYHAWFSA 
PSATLNDIYS EASLSRIQVA LDQEIARYRG EAEQARVLEN LGEFIRAAYY VRYNAGTATP
EFSQALSQRF AQSTNLFLNN PHALDQGREQ VGAMKSLTLM VDNVKQLPLT MDSMMAALMH
FNRETAQDTQ WVDGLNNLFR SMAGHVANDE FYRYMANNTH HIDTLARFAS DNAWALDTDA
SFIVFNALRE TGRLLASPDQ ETKRKALAVM QQVMQRYPLG SEHDKLWLAA VEMMSYYAPE
GLNGLDLDQA KQDLAARVMP NRFECQGPAI IRSEDLTDAQ AAKACEVLAA KEADFHQVAN
TGNQPVADDL NDRVEVAVFA SNDSYVDYSS FLFGNTTDNG GQYLEGNPSK ADNTARFVAY
RYANGEDLSI LNLEHEYTHY LDARFNQYGS FSDNLAHGHI VWWLEGFAEY MHYKQGYQAA
IELIAGGKLS LSTVFDTTYS HDTNRIYRWG YLAVRFMLEN HPQDVESLLA LSRTGQFEQW
AQQVTTLGQQ YDAEFERWLD TLEVEPEQPV TDPEEPIEPT EPEAQVTELT ANQSLQLSGE
AYSEKLFYVD VPANTVRFNV SIEGAGDADL YMSYNKVAHY YDFEMSQYGD GSNEEIVFEP
EQDGYVKAGR YYISLTGRDN YDSVNLVAAL DFEAQTPPTQ VQDDLAPVVL ESGEAKMLTV
HKQRYAAVYV PEGVEEVRVW LTAHSQTGNV DLYASREHWP TVEQHEYVSN YAGSNEYLAI
PVTEAGYVHF SLQAPEQGDD VEMLVHFN