Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2471 |
Symbol | |
ID | 5137232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2622138 |
End bp | 2623253 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640533922 |
Product | Smf/DprA family protein |
Protein accession | YP_001218364 |
Protein GI | 147675247 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATC AGGATTTAGC GGCATGGTTG GCGCTCTGTT TTACTCCTAA ACTAGGCAGC AAAACCATTT CTCACCTGCT TGCGACCCGT TTGCCAGCCC AGTTGCAAAG TTTTACGCCT AAGCAATGGT TGGCCTGCGG GCTTAAGCCC GAACAACTGG TGTTTTTAAC CACTCAAGCG GCTAAACAAG CCGAGCAGTG TTTGCAATGG CGATCAGCAG CCAATAACCG CTATATCGTC ACTCCTCATT GCCCGCTTTA CCCTCGTTTA TTGAAAGAGA TTAACTCATC GCCTCCCGTC CTGTTTATTG AAGGAATATG GGAAGCGGTG CATGACCCTG CGGTGGCTAT CGTCGGTAGC CGCAATGCCA GTGTTGATGG GCGGCAGATC GCTCGCCAGT TTGCCACTGA GCTCGCGCAG TCAGGTTTAG TGGTCACCAG TGGTTTAGCG CTTGGTATTG ATGGCTATGC GCACGATGGC GCTTTGCAAG CACAAGGGCA AACCGTAGCA GTATTAGGTT CAGGGCTGGC GCAGGTTTAC CCCAAACAGC ATCAAGGGTT AGCGGAGCGA ATCATCGCCC AAGGGGCCTT GGTTTCTGAG TTTGCCCCTC ACACACCGCC TAAAGCCGAT CACTTTCCGC GCCGTAACCG AATTATCAGC GGCTTATCGC TCGGTGTTGT GGTGGTAGAA GCTGCGGAGA AAAGCGGCTC ACTCATCACT GCACGCTACG CGGCTGAGCA AGGGCGTGAG GTCTTTGTGG TTCCCGGATC AATTTTTAAT GCCGCCAGCC AAGGTAGCAA CCAATTGATT CGCCAAGGCG CTTGTTTGGT GCAAAGTGTG CAACAAATTC ATCAAGAGCT CAAAAATGCG CTGACTTGGT CACTCTCTGA ACAAGTTCCT TATCAAGCAA CACTTTTTTC TGCTGTACAG AGCGATGAAG AATTGCCATT TCCCGAGCTG TTAGCTAACG TAGGAATAGA AGCTACACCT ATTGATATTC TCGCAAGCCG GACCCAGATA CCGGTGCAAG ATATCATGAT GCAGCTCTTG GAGCTTGAGC TCCTTGGGCA TGTGGTTGCA GTACCTGGTG GCTATATTAG AAAGGGGAGA GGCTAG
|
Protein sequence | MKDQDLAAWL ALCFTPKLGS KTISHLLATR LPAQLQSFTP KQWLACGLKP EQLVFLTTQA AKQAEQCLQW RSAANNRYIV TPHCPLYPRL LKEINSSPPV LFIEGIWEAV HDPAVAIVGS RNASVDGRQI ARQFATELAQ SGLVVTSGLA LGIDGYAHDG ALQAQGQTVA VLGSGLAQVY PKQHQGLAER IIAQGALVSE FAPHTPPKAD HFPRRNRIIS GLSLGVVVVE AAEKSGSLIT ARYAAEQGRE VFVVPGSIFN AASQGSNQLI RQGACLVQSV QQIHQELKNA LTWSLSEQVP YQATLFSAVQ SDEELPFPEL LANVGIEATP IDILASRTQI PVQDIMMQLL ELELLGHVVA VPGGYIRKGR G
|
| |