Gene EcE24377A_1409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1409 
Symbol 
ID5587462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1412677 
End bp1414197 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content44% 
IMG OID640925104 
Productputative recombinase 
Protein accessionYP_001462511 
Protein GI157158350 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGTG TAATAACATA CTTGAGATTC TCATCAGCCA TACAAGGCGC TGAAGGTGCA 
GATTCAACCA GACGACAAAA TGACCTGTTC AAGCAATGGT TGAAGAAGAA TAGCGATGCT
CAAGTAGTAG CGTCTTTCAG TGATGAAGGG TTGAGTGGTT ACAAAGGTAA GCATCTTACT
GGTCAGTTTG GTGACATGTT AGCCCGTATT GAGTCTGGGG AGTTTCCAGA AGGTACACTT
CTGTTAGTCG AAGCTATCGA CCGCATAGGC CGCCTTGAAC ATCTTGAAAC AGAAGCCTTG
ATGAATCGCA TTATTGCTCA TGGTATCGAG ATTCACACTC TACAGGATGG GCTAATCTAC
ACAAGGGATG CTCTATCCGA TGATTTAGGA ATCTCAATCA TCCAGCGCGT TAAAAGCTAC
GTAGCTCATC AAGAGTCTAA GCAGAAGTCT TTCCGTGTTA GCCAGAAGTG GAAACAACGT
GCAAAGCTTG CCCTTGCTGG TGAACAACGT TTAACAAAGA TGGTTCCCGG ATGGATTGAC
CCCGATACTT TTAAACTCAA TGAACACGCT GAGACTGTAA GACTGATTTT CAAGCTGCTG
CTAAGTGGTG AAAGTCTGCA TAACATCGCC CGTCACCTAC AGGCTAATAA CATTAGTTCA
TTCTCACGGC GTAAAGATGC TAACGGGTTC AGTGTTCACA GTGTTCGTAC TGTTTTACGC
TCTGAGTCAG TGATAGGGAC ACTACCAGCA TCACAGCGCA ATGACCGCCC CGCTATACCG
AACTACTACG AAGCCGCTAT AGATGCTTCA ACGTTCAATA AAGCTCAAGA AATCCTCGAT
AAAAATCGTA AAGGTCGCAC ACCTGCAAGT GATAACCCAT TAACGATTAA CATCTTCAAG
GGATTATTCC GGTGTCAGTG TGGGGCTAGT GTTCACCCTA CAGGGACTAA GAATAAGTAT
GCAGGGGTTT ACAGGTGCAA TAACAATCCT GACGGTCGCT GTGATGTTCC ACCGTTGAAG
CGTAAACCGT TTGATAAGTG GATGATTGAT AATTTTCTGG GGATGATTGA CGTGGGGAAT
GATGGAGAAG CAGAGGGGAA GATTGCATCT CTACAGCATG AGGTTGAAAT TGTCACAACC
AGAATCAAGA AAGCTACCGC CCTACTTCTT GAGATGGATG ATATTACAGA GTTGAAAGCA
CAGGTGAAGG AACTGAACCA GAAGCGCACA GAACTACAGA CCACGATTGA TAGCATGAGG
CGTAAAACTT CACTCAGTGA CAAGGGATTA CCCCAACTCA AAGACATTGA CCTTATGACT
AAAGCGGGTC GTGTTGAGTG TCAGTTGATT CTGTCCAAGC ATCTAAAAGG GCTTACATTG
GGTAAGGATT CGGTAACTGT AACGCTACAG AACGACACTG AAATAATTGT TCCTACAGAC
CCGCTACCTC TAAATGATGG AACATCTATC TTTGAAATTG CTGAAAAAGA GCTACTAGAA
ATAGACGCTT ATCAACTGTA G
 
Protein sequence
MRSVITYLRF SSAIQGAEGA DSTRRQNDLF KQWLKKNSDA QVVASFSDEG LSGYKGKHLT 
GQFGDMLARI ESGEFPEGTL LLVEAIDRIG RLEHLETEAL MNRIIAHGIE IHTLQDGLIY
TRDALSDDLG ISIIQRVKSY VAHQESKQKS FRVSQKWKQR AKLALAGEQR LTKMVPGWID
PDTFKLNEHA ETVRLIFKLL LSGESLHNIA RHLQANNISS FSRRKDANGF SVHSVRTVLR
SESVIGTLPA SQRNDRPAIP NYYEAAIDAS TFNKAQEILD KNRKGRTPAS DNPLTINIFK
GLFRCQCGAS VHPTGTKNKY AGVYRCNNNP DGRCDVPPLK RKPFDKWMID NFLGMIDVGN
DGEAEGKIAS LQHEVEIVTT RIKKATALLL EMDDITELKA QVKELNQKRT ELQTTIDSMR
RKTSLSDKGL PQLKDIDLMT KAGRVECQLI LSKHLKGLTL GKDSVTVTLQ NDTEIIVPTD
PLPLNDGTSI FEIAEKELLE IDAYQL