Gene YpsIP31758_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3036 
Symbolwzz 
ID5385752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3417834 
End bp3418985 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content36% 
IMG OID640866042 
Productferric enterobactin transport protein FepE 
Protein accessionYP_001401996 
Protein GI153950685 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3765] Chain length determinant protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.177075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATA AACAGGCACG AAATAACCTT GATAACCCAA TGCCGAATAA CTATGATTTT 
TCAAATGTAT CTTCTTCAAG AAATGAAATT GATCTCTTTG AAATTTTTGG TGTTGTATTT
AAATCAAAGT TCAAGATAAT ATTAATAACA CTATTTTTCT TAATTAGTGG TTTAGTGGTC
TCATATATCC TCCCCCAAAA ATGGACAAGC ACTGCAATAA TAGCTCTTCC TGGTGATGAG
CAAGTTCAAG TTCTGGATGA ACTAATCACA AATCTGACCG TGCTTGATAT AAAGGTTGAT
GTGAGTGCTA ATTATTTGCT GTCAACATTC AAACAAAATT TCGATTCTCA AGATCTCCGT
GAACAATATT TAGTAAATAC TAATTACTTT AAACGTTTGA TGAAGAATAA TCCAGAAGAT
GGTTTGGATA AAAGAGCGTT AATAGAGCGA ATCGTAAATG AAAATATTTC TTCGGTTAAC
CCATTGAAAG ATAAAACCGA GGGTGAAAAT GAATATCGCT ATTATAAATT ATCATATAGT
GCAAGCACAC CGACAGACGC TCGTGACTTG TTGCAAGGCT CTATTAACTA TGTAAATACC
ATCGTTAATG CTGATGTTTT CCGAAAAATA CAGCGAGCAG TGGATTTAGC CAAGGGTATC
GGTACAGATA AATACTCTAT GGAATTGTTG AAAGCTAGAA ATAACCAAAA AGTTAAAATT
GAGCGCTTAA GGTATGCTTC TTCTATCGCT GATGCCGCAG GTGTAAAAAA ACCAGTTTAC
AGCAATGGCT CAGCCATTAG TGATGATCCA GACTTCCCTA TTACTATGGG ATCCGATGCG
CTGAACCGTA AACTGGAAAT AGAGAAGTCA GTTATCGACC TGGCTTCAAT CAATACTGAA
CTTCTAAACC GTAAGTTGTA TTTGGATAAA TTAAATAGGT TAGAAATTCC TAATGTTAAT
ATTGTGCCAT TTAAATATTT GCAACAGCCA ACGGAACCCA CTAAAAGAGA TGCCCCTAAG
CGCGCATTGA TTGTGATTCT GTTTGCCCTG GTCGGTCTTA TGGGTTCTGT CGGTTTTGTT
TTAGTTGAGC ACTTTGTGCG TGAACGGAAG CGAGAAGAAG AGGGGCTTAA GCTCTCTCAA
ACTAAGGAAT AG
 
Protein sequence
MSNKQARNNL DNPMPNNYDF SNVSSSRNEI DLFEIFGVVF KSKFKIILIT LFFLISGLVV 
SYILPQKWTS TAIIALPGDE QVQVLDELIT NLTVLDIKVD VSANYLLSTF KQNFDSQDLR
EQYLVNTNYF KRLMKNNPED GLDKRALIER IVNENISSVN PLKDKTEGEN EYRYYKLSYS
ASTPTDARDL LQGSINYVNT IVNADVFRKI QRAVDLAKGI GTDKYSMELL KARNNQKVKI
ERLRYASSIA DAAGVKKPVY SNGSAISDDP DFPITMGSDA LNRKLEIEKS VIDLASINTE
LLNRKLYLDK LNRLEIPNVN IVPFKYLQQP TEPTKRDAPK RALIVILFAL VGLMGSVGFV
LVEHFVRERK REEEGLKLSQ TKE