Gene VC0395_A1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1744 
Symbol 
ID5137607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1865371 
End bp1866825 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content50% 
IMG OID640533201 
Producthypothetical protein 
Protein accessionYP_001217683 
Protein GI147674253 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTCT TCCCGACACG TACCCTGTTA TGTCTATGCA TTGCAGCGCC ATGTCTTCCT 
GCCATCGCAC AAAATGATCC TATCGAATTG CCGGATATTG GCACGGTAGC GGGCTCGACG
CTGACCATAG ATCAAGAACT GATTTATGGC GATGCTTATA TGCGTATGCT GCGCAATAAC
CAACCTGTGA TTAACGATCC TGTGCTCAAT GAGTACATTG ATAATTTAGG ACATCGTTTA
GTCGCCAGCG CCAACGATGT CAAAACGCCG TTCACCTTTT TTATGATCCG CGATCGTAAC
ATCAACGCTT TTGCCTTTTT TGGTGGTTAC GTCGCCTTGC ACTCTGGGTT GTTCCTTCAT
GCGCAAAGTG AAAGTGAGTT AGCGTCGGTC ATGGCGCATG AAATTGCTCA CGTCACCCAA
CGCCACCTTG CGCGCAGCAT GGAAGAACAA GCACGCCGCT CTCCCGCGAC AATCGCAGCG
CTCGCCGGTT CATTACTGCT GGCGATTGCC GCCCCAGAAG CAGGAATTGC GGCGATCAAC
GCCACCATGG CGGGCAGCAT CCAAGGCCAG ATTAACTACA CGCGTAGCAA TGAAAAAGAA
GCGGATCGAT TTGGTATCGC GACCTTAGCC AAAGCCGGAT TTGACGCCAA CGCCATGCCG
CAATTTTTCA CTCGCCTTGC TGATGAATAT CGCTACGCCA GTAAGCCGCC CCCTATGCTG
CTGACTCACC CACTACCAGA AGACCGGATT ACCGATAGCC GTGAGCGGGC CAGACAATAT
CCGCCACTCA AACTTGCTCC ACACTTGGAT TATCATTTGG CGCGCGCACG GATCATCGCT
CGTTATGCAG GCATTGATGC CGACGCAGCG TTGGATTGGT TTGCTCGCAG TGAGAAAAAA
ATCGACGCCA CCCTACAGCC GTCTATCCAG TACGGCAAAG CCTTGGTCTA TCTCGATCTC
AAACAGTTCG ATAAAGCAGA GCCACTGTTG ACCCAGCTAG TTAAAGAACA ACCGGACAAT
CATTTTTATC TCGATGCGAT CAGCGATTTG TATATTGAGC TCAAGCAAGC CGATAAAGCA
CAAAGCTTGT TAGAAAAGGC GCTCAAGCAG ACGCCAAATA ACTCAGTGTT GACCATTAAC
TATGCGAATG TGCTGCTTAA GCAAGATAAG TTCACCGATG CCATTCGAAT CTTGCAACGT
TACACCCATG ACAATCCTAA TGACATCAAT GGTTGGCAAC TACTGTCTGA AGCCAATAGC
CGTTTAGGCA ACAGTGCGGA AGACTTAGCG GCACGCGGTG AAATCATGGC GCTGCAAGCA
AACTGGAACA AAGCGATTCA GTTTTATACC CAAGCCAGTC AGTTGGTGGA ATTGGGTAGC
TTGGCGCAAG CCCGTTACGA TGCGCGGATT GACCAGTTAA TGGTGCAACG CGAGCGCTTT
TTATCCCTCC AATAA
 
Protein sequence
MKFFPTRTLL CLCIAAPCLP AIAQNDPIEL PDIGTVAGST LTIDQELIYG DAYMRMLRNN 
QPVINDPVLN EYIDNLGHRL VASANDVKTP FTFFMIRDRN INAFAFFGGY VALHSGLFLH
AQSESELASV MAHEIAHVTQ RHLARSMEEQ ARRSPATIAA LAGSLLLAIA APEAGIAAIN
ATMAGSIQGQ INYTRSNEKE ADRFGIATLA KAGFDANAMP QFFTRLADEY RYASKPPPML
LTHPLPEDRI TDSRERARQY PPLKLAPHLD YHLARARIIA RYAGIDADAA LDWFARSEKK
IDATLQPSIQ YGKALVYLDL KQFDKAEPLL TQLVKEQPDN HFYLDAISDL YIELKQADKA
QSLLEKALKQ TPNNSVLTIN YANVLLKQDK FTDAIRILQR YTHDNPNDIN GWQLLSEANS
RLGNSAEDLA ARGEIMALQA NWNKAIQFYT QASQLVELGS LAQARYDARI DQLMVQRERF
LSLQ