Gene YpAngola_A2983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2983 
Symbol 
ID5801455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3145370 
End bp3147718 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content56% 
IMG OID641340824 
ProductRhs element Vgr protein 
Protein accessionYP_001607354 
Protein GI162420635 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria
[COG4253] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.389792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.000144528 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATACCT TTCTCCCGAC TATTCGGTTT GACCATAGCC ACCACAAACT GGTGGTCCGC 
GACAGCACCG CCGCCGTTGA TGTGCTGGGC TTTGAAGGCC ATGAAAGCCT GAGCCAGCCG
TTCTGTTACG ACATCCAATT CACCAGCGCC GACAAGGCGA TTGGCCCGGC GACCATGCTG
ATGCACGACG CGTCACTGAC ATTGGCGGCC CCGATGGCCG AAGCTTTTGG CGTGACGGTT
CAGCAGACCC AGCGGGTGAT CCAAGGGGTG GTCACCGGAT TTAAACGCCT GTCGGCCTCG
AAAGAGGAAT GCCGCTACGA ACTGAGCCTG CAACCGCGCC TGGCCTTGTT ATCGCGCAGC
CATCAGAACG GGATATATCA GGACATGTCG GTGCCGCAGA TTGTGGAAAA AATTCTGCGT
GAGCGCCATG ACATGCGCGG TCAGGATTTT GTCTTCACGC TGGCCCGCGA GTACCCACGC
CGCGAGCAGG TAATGCAATA CGGCGAGGAT GACCTGACCT TTATCCGCCG CCTGCTGGCC
GAGGTGGGGA TCTGGTTTCG TTTTACCGCC GATCCCAAAC TCAATATTGA TGTGGTGGAG
TTTTACGACG ACCAGCGCTT CTATCAGCAA GGACTGACGC TACAGGCGGT GCCGCCTTCG
GGGATGCACG ACAGCGGGAT GGAATCGGTC TGGGACCTGT CCAGCGCCCA TCAGGTGGTG
GAAAAATCGG TCAGTACCGG CGATTACAAC TACCGCACCG CCACTGCCGA CCTGACCGCC
GGGGCCGACA TCACGCGCGG CGATACCACC ACCTACGGCG AAGCCTATCA TTACGCCGAT
AACTATCTGA CCGCAGGCAG CGAAGGGCGC GAGCCGGAAA GTGAAAGTGG GGCCTTTTAT
GCCCGTCTGC GCCATGAACG TTACCTCAAT AATCAGGCGC GCTTCGCCGG GGTGGCCAAT
GCGGCGGCAC TGGCACCGGG TCAGGAACTG AACGTCACCG GCAACGACGT GCCAGCACAG
TTTGGTAAAG GGGTGATAAT CACCCGCATC ACCAGCCATG CCCGCCGCGA CCGCAGCTAT
GAAGTACATT TTGAAGCCAT TCCTTACTCC GAGGATTATT GCTTCCGTCC GGCGCTGATC
CGCAAGCCGA CCATGGCCGG GACCTTGCCG GCGCGGGTGA CCAGTACCAC TGCGAACGAC
ACTTATGGTC ATATCGACAA AGACGGGCGC TACCGTGTCA ACCTGATGTT CGACCGTGAC
AGTTGGGAGT CGGGTTACGA AAGCTTGTGG GTCCGTCAGG CCCGCCCGTA TGCGGGTGAC
AGCTACGGCC TGCACCTGCC GCTGCTGGCA GGCACCGAAG TGGCGATCGC GTTTGAAGAC
GGCAACCCGG ACCGGCCGTA TATCGCCTAT GTGCTACACG ACTCGGCGCA CGGCGACCAT
GTCACCATCA GCAACTACAA ACGCAACGTA CTGCGTACTC CGTCGAATAA CAAACTGCGC
CTGGAGGATG AACGGGGTAA AGAGCACATC AAGCTCAGCA CCGAATATGG CGGCAAAAGC
CAACTGAATC TGGGGCATTT GGTTGATAAC GAGAAACAGC CTCGGGGTGA GGGTTTTGAG
CTGCGCACCG ACAGTTTTGG TGTATTACGG GCAGAAAAAG GCCTCTTTAT CACTGCCGAC
GGACAGGCCA AAGCACAGGG GCAGGTGCTG GAGATGCAAC CGGCTATCAG CCTGTTGAAA
AGTGCGCAGG AACAGATGGA GGCCATCTCC GCGGATGCAC AAACCGCCAC CGCCAGTCCG
GCCGACCTAC AGGTACAAAT CAGTCTGTTG CAGCAAAATC TCACTGAACT GAAACAAGCT
GTCCTGTTAC TGAGTGCGCC AAAGGGGATC GCGTTGAGTA GCGGGGAGCA TCTGCAAATG
AGTGCCAGCG ATAACCTGAT TGCCACCGCC GGTAAAAATG CCGACGTCAG CGTCGCGAAA
AATTTCTTTA TCGGCGTAGG CAATACCCTA AGTATCTTCG TCAGAAAGCT GGGGATGAAA
CTAATAGCCA ATCAGGGGCC GATAACAGTC CAGGCGCAGA ATGATCTAAT GGAGTTATTG
GCGCGTAAAG CGATCACCAT TACCAGCACC GAGGATGAGA TAAAAATCAC CGCCAAGAAG
AGGATCACGC TGAATGCGGG AGGCAGTTAT ATCACGCTGG ATGAGAATCG GATCGAGTCA
GGAACGGCGA GGGAATATTT GACTAAGGCA GGGCATTACG GGCGAGTGGA TAAGGCGAAA
TTGGAGACAG TAGTGCCAAC ATTAGCCGTT AAAGCTAAAC CACCCACTCA GAAGTATCCA
TTTTCTTAA
 
Protein sequence
MNTFLPTIRF DHSHHKLVVR DSTAAVDVLG FEGHESLSQP FCYDIQFTSA DKAIGPATML 
MHDASLTLAA PMAEAFGVTV QQTQRVIQGV VTGFKRLSAS KEECRYELSL QPRLALLSRS
HQNGIYQDMS VPQIVEKILR ERHDMRGQDF VFTLAREYPR REQVMQYGED DLTFIRRLLA
EVGIWFRFTA DPKLNIDVVE FYDDQRFYQQ GLTLQAVPPS GMHDSGMESV WDLSSAHQVV
EKSVSTGDYN YRTATADLTA GADITRGDTT TYGEAYHYAD NYLTAGSEGR EPESESGAFY
ARLRHERYLN NQARFAGVAN AAALAPGQEL NVTGNDVPAQ FGKGVIITRI TSHARRDRSY
EVHFEAIPYS EDYCFRPALI RKPTMAGTLP ARVTSTTAND TYGHIDKDGR YRVNLMFDRD
SWESGYESLW VRQARPYAGD SYGLHLPLLA GTEVAIAFED GNPDRPYIAY VLHDSAHGDH
VTISNYKRNV LRTPSNNKLR LEDERGKEHI KLSTEYGGKS QLNLGHLVDN EKQPRGEGFE
LRTDSFGVLR AEKGLFITAD GQAKAQGQVL EMQPAISLLK SAQEQMEAIS ADAQTATASP
ADLQVQISLL QQNLTELKQA VLLLSAPKGI ALSSGEHLQM SASDNLIATA GKNADVSVAK
NFFIGVGNTL SIFVRKLGMK LIANQGPITV QAQNDLMELL ARKAITITST EDEIKITAKK
RITLNAGGSY ITLDENRIES GTAREYLTKA GHYGRVDKAK LETVVPTLAV KAKPPTQKYP
FS