Gene VC0395_A1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1474 
Symbol 
ID5137442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1581583 
End bp1582803 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID640532932 
Producthypothetical protein 
Protein accessionYP_001217417 
Protein GI147675143 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID[TIGR02212] lipoprotein releasing system, transmembrane protein, LolC/E family 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGGAT TTATGTTTCA TCCGATTTCG GCTTTTATTG GTTTGCGCTA TTTGCGCGGC 
CGCTCAGGGG ATCGTTTTAG CCGCTTTGTC TCTTATATGT CGACGGCGGG GATCACGATC
GGTGTGATGT CGCTAGTTAC TGTTTTATCG GTGATGAACG GTTTTGAGGC GCAGCTTAAG
TCACGCATTC TTGGGGTGTT ACCTCAAGCG GTTGTCACCG AAGCCGCAGG CAAAACCACC
CTCAGCGCAA CGCCCCCAGA TTTTGTCACG GCATTATCGA CTCAGCGCCC ACCAGAGCCG
TTAGTGCGCA GTGACGCGGT GGTGCAAAGT GCCTCGCAAC TGGCTGCGGG TTTGTTGATT
GGTATCGAGC CACAACAAAA TGATCCGATT GAGCAACACC TGATTGCAGG TCGGGTAACG
GCTTTGCAAG CGGGCGAGTA TCAACTCTTT CTTGGGCATC TTCTTGCGCG CAGTTTGAAT
GTCACCGTGG GTGATAAAGT GCGTTTGATG GTGACAGAAG CGAGCCAATT TACCCCGTTG
GGCCGTTTGC CCAGTCAGCG AAACTTTACG GTAGCGGGAA TTTTTAATAC CGGTTCGGAT
GTGGATGGGC AACTCATGGT GACTCATCTG CGCGATGCGG CCAAGCTATT GCGCTATGAT
GCACAGACCA TTTCAGGTTG GCGGCTATTT TTTGATGACC CGTTTGTGGT CAGTCAGCTT
GCCGAACAGC CATTGCCGCA AGATTGGCAA TGGAGCGACT GGCGTGAGCA GCGCGGTGAA
CTGTTTCAAG CGGTACGTAT GGAAAAGAAC ATGATGGGGC TGATGCTCGG ACTTATCGTT
GGGGTGGCTG CGTTTAATAT TATTTCTGCG CTGATCATGG TAGTGATGGA GAAGCAGGCT
GAGGTGGCGA TTTTAAAAAC CCAAGGCATG CAGTCACAAC ATGTGCTGGC GATTTTCATG
GTACAAGGCG CCAGCAGCGG TGTGATTGGT GCGCTGGTTG GTGGTTTACT TGGCGTGCTG
TTGGCGGCCA ATTTAAATAG CCTGATGGAA GCCTTAGGTG TTGCCCTTTT TTCGGTGGGT
GGCAGTTTGC CAGTGGCGAT TGATCCGCTG CAGATTGTCC TCGTTATTGT TCTTGCGATT
GTATTAAGTC TGTTGGCAAC GCTGTTTCCA GCCTATCGCG CATCTTCTGT TCAACCCGCT
GAGGCGTTAC GTTATGAATA A
 
Protein sequence
MVGFMFHPIS AFIGLRYLRG RSGDRFSRFV SYMSTAGITI GVMSLVTVLS VMNGFEAQLK 
SRILGVLPQA VVTEAAGKTT LSATPPDFVT ALSTQRPPEP LVRSDAVVQS ASQLAAGLLI
GIEPQQNDPI EQHLIAGRVT ALQAGEYQLF LGHLLARSLN VTVGDKVRLM VTEASQFTPL
GRLPSQRNFT VAGIFNTGSD VDGQLMVTHL RDAAKLLRYD AQTISGWRLF FDDPFVVSQL
AEQPLPQDWQ WSDWREQRGE LFQAVRMEKN MMGLMLGLIV GVAAFNIISA LIMVVMEKQA
EVAILKTQGM QSQHVLAIFM VQGASSGVIG ALVGGLLGVL LAANLNSLME ALGVALFSVG
GSLPVAIDPL QIVLVIVLAI VLSLLATLFP AYRASSVQPA EALRYE