Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2983 |
Symbol | |
ID | 5801455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 3145370 |
End bp | 3147718 |
Gene Length | 2349 bp |
Protein Length | 782 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641340824 |
Product | Rhs element Vgr protein |
Protein accession | YP_001607354 |
Protein GI | 162420635 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria [COG4253] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.389792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.000144528 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATACCT TTCTCCCGAC TATTCGGTTT GACCATAGCC ACCACAAACT GGTGGTCCGC GACAGCACCG CCGCCGTTGA TGTGCTGGGC TTTGAAGGCC ATGAAAGCCT GAGCCAGCCG TTCTGTTACG ACATCCAATT CACCAGCGCC GACAAGGCGA TTGGCCCGGC GACCATGCTG ATGCACGACG CGTCACTGAC ATTGGCGGCC CCGATGGCCG AAGCTTTTGG CGTGACGGTT CAGCAGACCC AGCGGGTGAT CCAAGGGGTG GTCACCGGAT TTAAACGCCT GTCGGCCTCG AAAGAGGAAT GCCGCTACGA ACTGAGCCTG CAACCGCGCC TGGCCTTGTT ATCGCGCAGC CATCAGAACG GGATATATCA GGACATGTCG GTGCCGCAGA TTGTGGAAAA AATTCTGCGT GAGCGCCATG ACATGCGCGG TCAGGATTTT GTCTTCACGC TGGCCCGCGA GTACCCACGC CGCGAGCAGG TAATGCAATA CGGCGAGGAT GACCTGACCT TTATCCGCCG CCTGCTGGCC GAGGTGGGGA TCTGGTTTCG TTTTACCGCC GATCCCAAAC TCAATATTGA TGTGGTGGAG TTTTACGACG ACCAGCGCTT CTATCAGCAA GGACTGACGC TACAGGCGGT GCCGCCTTCG GGGATGCACG ACAGCGGGAT GGAATCGGTC TGGGACCTGT CCAGCGCCCA TCAGGTGGTG GAAAAATCGG TCAGTACCGG CGATTACAAC TACCGCACCG CCACTGCCGA CCTGACCGCC GGGGCCGACA TCACGCGCGG CGATACCACC ACCTACGGCG AAGCCTATCA TTACGCCGAT AACTATCTGA CCGCAGGCAG CGAAGGGCGC GAGCCGGAAA GTGAAAGTGG GGCCTTTTAT GCCCGTCTGC GCCATGAACG TTACCTCAAT AATCAGGCGC GCTTCGCCGG GGTGGCCAAT GCGGCGGCAC TGGCACCGGG TCAGGAACTG AACGTCACCG GCAACGACGT GCCAGCACAG TTTGGTAAAG GGGTGATAAT CACCCGCATC ACCAGCCATG CCCGCCGCGA CCGCAGCTAT GAAGTACATT TTGAAGCCAT TCCTTACTCC GAGGATTATT GCTTCCGTCC GGCGCTGATC CGCAAGCCGA CCATGGCCGG GACCTTGCCG GCGCGGGTGA CCAGTACCAC TGCGAACGAC ACTTATGGTC ATATCGACAA AGACGGGCGC TACCGTGTCA ACCTGATGTT CGACCGTGAC AGTTGGGAGT CGGGTTACGA AAGCTTGTGG GTCCGTCAGG CCCGCCCGTA TGCGGGTGAC AGCTACGGCC TGCACCTGCC GCTGCTGGCA GGCACCGAAG TGGCGATCGC GTTTGAAGAC GGCAACCCGG ACCGGCCGTA TATCGCCTAT GTGCTACACG ACTCGGCGCA CGGCGACCAT GTCACCATCA GCAACTACAA ACGCAACGTA CTGCGTACTC CGTCGAATAA CAAACTGCGC CTGGAGGATG AACGGGGTAA AGAGCACATC AAGCTCAGCA CCGAATATGG CGGCAAAAGC CAACTGAATC TGGGGCATTT GGTTGATAAC GAGAAACAGC CTCGGGGTGA GGGTTTTGAG CTGCGCACCG ACAGTTTTGG TGTATTACGG GCAGAAAAAG GCCTCTTTAT CACTGCCGAC GGACAGGCCA AAGCACAGGG GCAGGTGCTG GAGATGCAAC CGGCTATCAG CCTGTTGAAA AGTGCGCAGG AACAGATGGA GGCCATCTCC GCGGATGCAC AAACCGCCAC CGCCAGTCCG GCCGACCTAC AGGTACAAAT CAGTCTGTTG CAGCAAAATC TCACTGAACT GAAACAAGCT GTCCTGTTAC TGAGTGCGCC AAAGGGGATC GCGTTGAGTA GCGGGGAGCA TCTGCAAATG AGTGCCAGCG ATAACCTGAT TGCCACCGCC GGTAAAAATG CCGACGTCAG CGTCGCGAAA AATTTCTTTA TCGGCGTAGG CAATACCCTA AGTATCTTCG TCAGAAAGCT GGGGATGAAA CTAATAGCCA ATCAGGGGCC GATAACAGTC CAGGCGCAGA ATGATCTAAT GGAGTTATTG GCGCGTAAAG CGATCACCAT TACCAGCACC GAGGATGAGA TAAAAATCAC CGCCAAGAAG AGGATCACGC TGAATGCGGG AGGCAGTTAT ATCACGCTGG ATGAGAATCG GATCGAGTCA GGAACGGCGA GGGAATATTT GACTAAGGCA GGGCATTACG GGCGAGTGGA TAAGGCGAAA TTGGAGACAG TAGTGCCAAC ATTAGCCGTT AAAGCTAAAC CACCCACTCA GAAGTATCCA TTTTCTTAA
|
Protein sequence | MNTFLPTIRF DHSHHKLVVR DSTAAVDVLG FEGHESLSQP FCYDIQFTSA DKAIGPATML MHDASLTLAA PMAEAFGVTV QQTQRVIQGV VTGFKRLSAS KEECRYELSL QPRLALLSRS HQNGIYQDMS VPQIVEKILR ERHDMRGQDF VFTLAREYPR REQVMQYGED DLTFIRRLLA EVGIWFRFTA DPKLNIDVVE FYDDQRFYQQ GLTLQAVPPS GMHDSGMESV WDLSSAHQVV EKSVSTGDYN YRTATADLTA GADITRGDTT TYGEAYHYAD NYLTAGSEGR EPESESGAFY ARLRHERYLN NQARFAGVAN AAALAPGQEL NVTGNDVPAQ FGKGVIITRI TSHARRDRSY EVHFEAIPYS EDYCFRPALI RKPTMAGTLP ARVTSTTAND TYGHIDKDGR YRVNLMFDRD SWESGYESLW VRQARPYAGD SYGLHLPLLA GTEVAIAFED GNPDRPYIAY VLHDSAHGDH VTISNYKRNV LRTPSNNKLR LEDERGKEHI KLSTEYGGKS QLNLGHLVDN EKQPRGEGFE LRTDSFGVLR AEKGLFITAD GQAKAQGQVL EMQPAISLLK SAQEQMEAIS ADAQTATASP ADLQVQISLL QQNLTELKQA VLLLSAPKGI ALSSGEHLQM SASDNLIATA GKNADVSVAK NFFIGVGNTL SIFVRKLGMK LIANQGPITV QAQNDLMELL ARKAITITST EDEIKITAKK RITLNAGGSY ITLDENRIES GTAREYLTKA GHYGRVDKAK LETVVPTLAV KAKPPTQKYP FS
|
| |