Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_3982 |
Symbol | |
ID | 5386526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 4467164 |
End bp | 4469539 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640867013 |
Product | hypothetical protein |
Protein accession | YP_001402930 |
Protein GI | 153950179 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAC CACTGAGCCG CATTATTGCA AGCGAACTGC AGGCCCGGCC GGAGCAAGTT ATCTCCGCTA TCCGCCTGCT TGATGAAGGT AATACCGTGC CCTTTATTTC ACGGTATCGT AAGGAAGTTA CCGGCGGGTT AGATGATATC CAACTGCGTC AGTTGGAAAG CCGTCTGGGG TATCTGCGTG AATTAGAAGA TCGCCGCCAA ACCATTCTTA AATCAATTGA AGATCAAGGA AAACTCACCG ACCAGCTGGC CGGGGCGATC AACGCCACCC TAAGTAAGAC CGAGCTGGAA GATCTGTATC TTCCTTATAA ACCGAAGCGC CGCACTCGCG GACAAATTGC CATTGAAGCC GGGTTAGAAC CCCTGGCAGA GCGTTTATGG CAGGATCCAC AACAAGACCC TGAACACACC GCGCTGGCCT ATGTTGATGC CGATAAAGGC GTCGCTGATA CTAAAGCCGC ATTGGATGGT GCTCGCTATA TTTTGATGGA GCGGTTTGCC GAAGATGCCA CCCTGCTGGC GAAAGTGCGT CAGTATCTGT GGAAAAACGC CCATCTGGTG TCAAAAGTCG TGGAAGGTAA AGAGCAGGAA GGCGCTAAAT TCCGCGATTA CTTCGATCAC CACGAACCTA TCGCACAAGT CCCTTCTCAC CGCGCATTGG CCATGTTCCG TGGCCGCAAT GAGGGGGTAC TGCAACTGGC CTTGGATCCT GATCCGCAAT TTGACGAACC GCCGCGTGAA AGTCAGGGTG AACAGATCAT TATCAACCAT CTTGATCTGC GCTTGAATAA TGCGCCGGCA GACGGCTGGC GTAAGGCGGT GGTCAACTGG ACCTGGCGTA TCAAGGTGCT GTTGCATCTG GAAACCGAGC TGATGAGCAC CTTGCGTGAA CGGGCTGAAG ATGAGGCTAT CAATGTCTTT GCCCGTAATA TGCAAGATTT ACTGATGGCC GCGCCAGCAG GTATGCGCGC GACCATGGGC CTCGATCCCG GCCTGCGTAC TGGCGTGAAA GTCGCGGTGG TGGATGCAAC AGGCAAGCTG GTCGCTTTCG ATACCATCTA CCCACACACC GGCCAGGCAG CAAAAGCCGC CGCCGTTGTC GCCGCCCTTT GCATCAAACA CCAGGTTGAA CTGGTGGCTA TCGGTAACGG TACTGCCTCA CGGGAAACCG AGCGCTTCTT TGTGGAGCTA CAGCAACAGT ACCCGGCCGT CACCGCCCAA AAAGTCATTG TCAGTGAGGC CGGTGCCTCG GTCTATTCAG CCTCTGAATT GGCCTCGCAA GAGTTTCCTG ATCTGGATGT CTCCATCCGT GGCGCGGTTT CCATTGCTCG CCGTCTGCAA GATCCGTTGG CTGAACTGGT AAAAATCGAT CCGAAATCTA TCGGTGTTGG TCAGTATCAG CACGATGTCA GCCAAAGCCA ATTGGCGAAA AAGCTGGATG CGGTGGTGGA AGACTGCGTA AACGCCGTCG GCGTGGATTT AAACACGGCT TCGGTGCCGT TACTGACACG TGTTGCCGGT TTGACGCGCA TGATGGCACA GAACATTGTG AACTGGCGTG ATGAGAATGG CCGCTTCCGC AACCGTGAGC AATTACTGAA AGTCAGCCGC CTCGGGCCGA AAGCCTTCGA ACAGTGTGCA GGCTTCTTGC GTATTAACCA CGGCGATAAC CCCTTAGACG CCTCGACAGT TCACCCAGAA GCCTATCCGG TAGTTGAGCG TATTTTAGCG GCCACCGAGC AGGCGTTGCA GGACTTAATG GGCAATGCCA ATGCGCTGCG CAACCTTAAT GCTCGCGATT TTACTACTGA GCGTTTTGGC GTACCAACGG TAACCGATAT TCTGCGAGAG CTGGAAAAGC CAGGCCGTGA CCCGCGCCCT GAATTTAAAA CAGCCACCTT CGCGGAAGGG GTGGAAACAC TGAATGACCT GACACCGGGC ATGATCCTTG AAGGCGCGGT CACTAACGTG ACAAATTTTG GTGCTTTTGT GGATATCGGC GTTCATCAGG ATGGTTTGGT GCATATCTCT TCACTGGCCG ATAAGTTTGT CGATGATCCA CATAAAGTGG TGAAAGCCGG CGATATCGTC AAAGTCAAAG TGATGGAAGT GGATCTGCAA CGTAAGCGCA TCGCCCTGAC CATGCGCCTT GATGAGCAGC CAGGTGAAAC TCACTCCCGC CGATCCAACA ATGGCACGGG TAGCGAGCGC ACCAATAATG ACAACCGCGG GGTAAATCGC CCACATAACG ACGCGAAAGG TCATAATGCG CCTAACCGTG CTCCGGCCAA AGGGCGATCA GATAGCAGCT CGGCGGGTAA CAGCGCCATG AGCGATGCGC TGGCGTCGGC CTTTAAAAAG CGTTAG
|
Protein sequence | MNEPLSRIIA SELQARPEQV ISAIRLLDEG NTVPFISRYR KEVTGGLDDI QLRQLESRLG YLRELEDRRQ TILKSIEDQG KLTDQLAGAI NATLSKTELE DLYLPYKPKR RTRGQIAIEA GLEPLAERLW QDPQQDPEHT ALAYVDADKG VADTKAALDG ARYILMERFA EDATLLAKVR QYLWKNAHLV SKVVEGKEQE GAKFRDYFDH HEPIAQVPSH RALAMFRGRN EGVLQLALDP DPQFDEPPRE SQGEQIIINH LDLRLNNAPA DGWRKAVVNW TWRIKVLLHL ETELMSTLRE RAEDEAINVF ARNMQDLLMA APAGMRATMG LDPGLRTGVK VAVVDATGKL VAFDTIYPHT GQAAKAAAVV AALCIKHQVE LVAIGNGTAS RETERFFVEL QQQYPAVTAQ KVIVSEAGAS VYSASELASQ EFPDLDVSIR GAVSIARRLQ DPLAELVKID PKSIGVGQYQ HDVSQSQLAK KLDAVVEDCV NAVGVDLNTA SVPLLTRVAG LTRMMAQNIV NWRDENGRFR NREQLLKVSR LGPKAFEQCA GFLRINHGDN PLDASTVHPE AYPVVERILA ATEQALQDLM GNANALRNLN ARDFTTERFG VPTVTDILRE LEKPGRDPRP EFKTATFAEG VETLNDLTPG MILEGAVTNV TNFGAFVDIG VHQDGLVHIS SLADKFVDDP HKVVKAGDIV KVKVMEVDLQ RKRIALTMRL DEQPGETHSR RSNNGTGSER TNNDNRGVNR PHNDAKGHNA PNRAPAKGRS DSSSAGNSAM SDALASAFKK R
|
| |