Gene YpAngola_A2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2994 
Symbol 
ID5801466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3162440 
End bp3163525 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content59% 
IMG OID641340833 
Producthypothetical protein 
Protein accessionYP_001607363 
Protein GI162418124 
COG category[S] Function unknown 
COG ID[COG3520] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03347] type VI secretion protein, VC_A0111 family 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000000723693 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAAGAG AATCATCAGT CCCGCCTTCC CGGCTAATCA CCCGGCTTAA GCGCGATATC 
GGCCAGATCA ATTTTTACCG TTTCTGTCAG TTGCTGGAGC AACAACCGGG AACCTTGCCG
CTCGGCAGTA CCAACAGCCC GGCGGATGAT CCGGTGCGTT TTCGTCCGCA TCCCGGCATG
GGTTTTCCGG TCAGTGAACT GAAGGCGCTG GAAAACGACC CGCTGCATCC GGAAGCCCCG
TTGACCGCGC GCACCACCTT TATGGGGTTG TACGGGGTGG ATTCGCCTTT GCCGACGGCC
TATATCGACG ATATCACCCA GCAGCGCGAA GGGCATGAGG CGCTGGAAGG GTTCCTCGAT
ATTTTCAACC ACCGTTTTAT GACGCAGTTT TATCGCATCT GGCGCAAATA TTCCTATCCG
GCCACTTTTG AGTCGGGCGG CACAGACAAT ACCTCTCAGA GCCTGTTGGG GCTGATAGGC
ATGGGTATTC CTGGCTCACA GCAGCATTTT GCCACCCCGA CATCGCGCTT TCTGGCCTTG
CTGGGGGTGA TGCGACAACC GGGGCGCACT GCCGAAGGGG TACAGGCGCT GGTCCGGCTG
CTGGCCCCGT TTACCGAGGC GGATGTCACG CCCCATTGCC TGCGCACGGT GCGCCTGACT
TCTCCGATGG CGTTTGCCGA TGAAGGGGCC AACTGGCTGG ACGGCTACAC CGTGCTGGGT
GATGAGGCCA TTGATGCCAA CAGTCAGTTG CTGATTTCGC TGCGCACGGC AGACCGCGAC
GAGGCCGCTA ACTGGTTACC GGATGGCCCG CTGTATACCG ATTTTCTGGT GCTGTTGCGG
GTCTATCTGG GCTGGCGTTA CCGGGCCTGT ATCCAACTGA CGGTCGCCAC CCGGTTGCTG
CCGGCGTTGG TGTTGGATGA GACACCGATA AGGCTGGGAT TAACCGGCGT GCTGGGGCTG
GATGACGCGA CGCTGTCAGA CGATATACCT GAATATTTTA CCGTGGCGCT GGGCCACTAC
CGTGGGCTGG CTCCACAGCA ACATCAAGAA GGAAACCGAC GTGTTAACTA CCGTTTGGAA
AAATAA
 
Protein sequence
MGRESSVPPS RLITRLKRDI GQINFYRFCQ LLEQQPGTLP LGSTNSPADD PVRFRPHPGM 
GFPVSELKAL ENDPLHPEAP LTARTTFMGL YGVDSPLPTA YIDDITQQRE GHEALEGFLD
IFNHRFMTQF YRIWRKYSYP ATFESGGTDN TSQSLLGLIG MGIPGSQQHF ATPTSRFLAL
LGVMRQPGRT AEGVQALVRL LAPFTEADVT PHCLRTVRLT SPMAFADEGA NWLDGYTVLG
DEAIDANSQL LISLRTADRD EAANWLPDGP LYTDFLVLLR VYLGWRYRAC IQLTVATRLL
PALVLDETPI RLGLTGVLGL DDATLSDDIP EYFTVALGHY RGLAPQQHQE GNRRVNYRLE
K