Gene YpsIP31758_3419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3419 
SymbolrapA 
ID5386253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3847096 
End bp3850002 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content52% 
IMG OID640866432 
ProductATP-dependent helicase HepA 
Protein accessionYP_001402374 
Protein GI153949731 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGACACAG AAAGCGAACT TGGATTGGGT 
ACTGTCGTTG CCATTGACGT GCGCATGATC ACCTTACTGT TCCCTGCAAC CGGTGAAAAC
CGCCTTTACG CCAGAAATGA CTCGCCAATC ACCCGGGTCA TGTTCAATCC GAGCGATACC
ATCACTCACC ACGAAGGCTG GCAGCTGAAA GTGGAAGAAG TGACTCAAGA AAATGGGCTG
ATTACCTATA TCGGTACCCG TTTGGATACT GAGGAAACCG GCGTCGCCAT GCGTGAAGTT
TTGCTCGATA GCAAACTGAC TTTTAGTAAA CCACAGGATC GCTTATTCGC CGGTCAGATC
GATCGCATGG ATCGCTTTGC CCTGCGTTTT CGCGCCCGTA AATACCAAAG CGAGCAGTTC
CGGTTGCCGT GGAGCGGCCT GCGTGGTATC CGTGCCAGCC TGATCCCACA CCAGTTACAT
ATCGCTTATG AAGTGGGTCA GCGCCACGCG CCACGGGTCT TACTGGCCGA TGAAGTGGGG
TTGGGTAAAA CCATCGAAGC CGGGATGATT ATCCACCAGC AACTGCTGGC TGGCCGTGCT
GAACGGGTGC TGATCGTGGT ACCGGAGAGC CTGCAACACC AGTGGCTAGT AGAGATGTTA
CGCCGCTTCA ACCTGCGCTT CTCACTGTTT GATGACAGCC GTTATTCCGA AGCCTTGTTG
GACAGTAGCA ATCCGTTTGA TACCGAACAG ATGGTGATTT GTTCGCTGGA TTTTGTTCGT
CGGAATAAGC AGCGGTTAGA ACAACTGGCC GATGCTTCCT GGGATCTGCT GGTGGTGGAT
GAAGCGCACC ATCTGGCCTG GAGTGAAGAG GCTCCAAGCC GCGAATATCA AGTGATTGAA
CAGTTGGCCG AGCATATCCC CGGCGTGCTG TTATTGACAG CAACACCAGA GCAATTGGGC
CAACAGAGTC ACTTTGCCCG CCTGCGTTTG CTGGATCCAG ATCGCTTCCA TGATTACGAA
GAGTTCGTCA ATGAACAACA GAAATACCGG CCTATCGCCG ATGCCGTCAC CTTGCTGTTA
GGGGGCGAGC GTTTAACCGA TGATAAGCTG AATCTGTTAG GTGAACTGAT TGATGAACAA
GATATTGAGC CACTACTGAA AGCGGCCAAT AGCCAAAGTG AAGACAGCGA AGCGGCGCGT
CAAGAATTAG TTACCATGCT GATGGATAGG CACGGTACTA GCCGGGTGCT GTTCCGTAAT
ACCCGTAATG GGGTAAAAGG CTTCCCGCAC CGCGTCTTAC ACCAAATTAA GCTGCCATTG
CCAACACAGT ACCAAACTGC AATTAAGGTC TCTGGCATCA TGGGGGCGAA GAAAACCCTC
GACGCTCGTG CAAAAGATAT GCTTTATCCC GAGCAAATCT ATCAGGAGTT TGAAGGCGAA
AACGCCACCT GGTGGAACTT TGACCCACGG GTTGAATGGC TACTGAACTA TTTGGTTGCC
AACCGGGGTG AGAAGGTGTT GGTGATCTGC GCACAAGCCG CTACAGCTTT GCAGCTTGAA
CAAGTGTTAC GTGAGCGTGA AGCTATTCGG GCCGCCGTCT TCCATGAAGG TTTATCGCTG
ATTGAACGTG ACCGTGCCGC CGCCTATTTT GCCTCAGAAG AAGATGGTGC TCAGGTCTTA
CTGTGTTCCG AAATTGGTTC AGAGGGGCGT AACTTCCAAT TTGCCTGCCA ACTGGTCATG
TTCGATTTGC CGTTCAACCC CGATCTCCTG GAGCAACGTA TTGGCCGTTT GGATCGTATC
GGCCAGAACC GTGAAATCCA GATTATGGTG CCTTATTTGG AAGACACCGC GCAGGCAATA
CTGGTTCGTT GGTACCACGA AGGGTTGGAT GCGTTCGAAC ACACCTGCCC AACAGGCCGT
ACCATTTATG ACAGCAGCTA TCAGGAACTG ATTAGCTATT TGGCCACGCC AAGTGAGCAG
GAAGGGTTGG ATGAGTTTAT CCACACCTGC CGTCAGCAGC ACGAAGGGTT AAAACTTCAG
TTGGAACAGG GCCGTGACCG CTTACTGGAG ATGCACTCTA ACGGTGGCGA ACATGGGCAG
GAGCTGGCAC AGAGCATTGC CGAACAAGAT AATGACATCA ATTTAGTCAG TTTTGCACTC
AACCTGTTCG ATATTGTCGG GATCAACCAG GAAGATCGTA GCGATAACCT GATCGTGCTG
ACCCCGTCCG ATCACATGCT GGTGCCCGAT TTCCCCGGTT TGCCACCAGA TGGCTGCACC
GTCACCTTTG ATCGTGAACA GGCACTCTCG CGGGAAGATG CCCAGTTTGT CAGTTGGGAA
CACCCCATCA TCCGTAATGG GTTGGATTTG ATCCTGTCTG GCGATACCGG CAGTTGCGCG
GTGTCTTTAT TGAAAAATAA AGCCTTACCC GTCGGTACCC TGCTGGCTGA GTTGGTTTAC
GTGGTCGAGG CACAAGCACC GAAACACTTG CAACTGACCC GTTTCTTGCC ACCGACCCCC
GTGCGTATGC TCATGGATCG CAACGGCACT AACCTGGCGG CACAGGTTGA ATTTGAAAGT
TTCAATCGCC AACTGAATGC GGTAAACCGC CATACCTCCA GCAAGCTGGT CAATGCGGTT
CAGCAAGAAG TTCACACCAT GTTGCAACAA GCCGAAGCAC TGGTAGAAGC GCAAGCGCAG
GCTCTGATTG AAACGGCAAA ACGCGAGGCC GATGATAAGT TGAGTACTGA ACTGGCGCGT
CTGGAAGCGT TGAAAGCGGT TAACCCGAAT ATTCGTGATG ACGAAATAGA GGCGCTTGAG
CATAACCGTA AGATGGTGCT GGAAAACCTG AATCAAGCAG GCTGGCGTTT AGATGCTATC
CGGCTGGTGG TGGTAACACA TCAGTAG
 
Protein sequence
MPFTLGQRWI SDTESELGLG TVVAIDVRMI TLLFPATGEN RLYARNDSPI TRVMFNPSDT 
ITHHEGWQLK VEEVTQENGL ITYIGTRLDT EETGVAMREV LLDSKLTFSK PQDRLFAGQI
DRMDRFALRF RARKYQSEQF RLPWSGLRGI RASLIPHQLH IAYEVGQRHA PRVLLADEVG
LGKTIEAGMI IHQQLLAGRA ERVLIVVPES LQHQWLVEML RRFNLRFSLF DDSRYSEALL
DSSNPFDTEQ MVICSLDFVR RNKQRLEQLA DASWDLLVVD EAHHLAWSEE APSREYQVIE
QLAEHIPGVL LLTATPEQLG QQSHFARLRL LDPDRFHDYE EFVNEQQKYR PIADAVTLLL
GGERLTDDKL NLLGELIDEQ DIEPLLKAAN SQSEDSEAAR QELVTMLMDR HGTSRVLFRN
TRNGVKGFPH RVLHQIKLPL PTQYQTAIKV SGIMGAKKTL DARAKDMLYP EQIYQEFEGE
NATWWNFDPR VEWLLNYLVA NRGEKVLVIC AQAATALQLE QVLREREAIR AAVFHEGLSL
IERDRAAAYF ASEEDGAQVL LCSEIGSEGR NFQFACQLVM FDLPFNPDLL EQRIGRLDRI
GQNREIQIMV PYLEDTAQAI LVRWYHEGLD AFEHTCPTGR TIYDSSYQEL ISYLATPSEQ
EGLDEFIHTC RQQHEGLKLQ LEQGRDRLLE MHSNGGEHGQ ELAQSIAEQD NDINLVSFAL
NLFDIVGINQ EDRSDNLIVL TPSDHMLVPD FPGLPPDGCT VTFDREQALS REDAQFVSWE
HPIIRNGLDL ILSGDTGSCA VSLLKNKALP VGTLLAELVY VVEAQAPKHL QLTRFLPPTP
VRMLMDRNGT NLAAQVEFES FNRQLNAVNR HTSSKLVNAV QQEVHTMLQQ AEALVEAQAQ
ALIETAKREA DDKLSTELAR LEALKAVNPN IRDDEIEALE HNRKMVLENL NQAGWRLDAI
RLVVVTHQ