Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2768 |
Symbol | recN |
ID | 6144042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2847359 |
End bp | 2849020 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617638 |
Product | recombination and repair protein |
Protein accession | YP_001744799 |
Protein GI | 170683348 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0497] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00634] DNA repair protein RecN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00054696 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.107417 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGCAC AACTGACCAT CAGCAACTTT GCTATCGTTC GTGAGCTTGA GATTGATTTT CATAGCGGCA TGACCGTAAT AACTGGCGAG ACCGGCGCGG GTAAATCTAT TGCAATAGAT GCCCTCGGTC TTTGTCTCGG TGGTCGCGCT GAAGCCGACA TGGTGCGTAC CGGCGCTGCT CGCGCTGACC TGTGCGCCCG TTTTTCTCTG AAAGATACGC CAGCGGCCCT GCGCTGGCTG GAAGAAAACC AGCTTGAAGA CGGGCATGAA TGTTTGCTTC GTCGCGTAAT CAGCAGCGAT GGTCGCTCCC GTGGTTTTAT CAACGGTACA GCGGTTCCTC TTTCACAACT GCGCGAACTG GGTCAGTTGC TGATTCAGAT CCATGGTCAG CACGCTCATC AGTTACTCAC CAAACCCGAG CACCAAAAAT TCCTGCTTGA TGGCTATGCC AATGAAACCT CTCTACTCCA GGAAATGACC GCACGTTATC AGTTGTGGCA TCAAAGCTGC CGTGACCTCG CGCATCATCA ACAGTTAAGT CAGGAACGCG CCGCCCGTGC AGAACTGCTG CAATACCAAT TAAAAGAACT TAACGAATTT AATCCGCAGC CCGGAGAGTT TGAGCAAATC GACGAAGAGT ACAAACGTCT GGCGAACAGC GGTCAATTGC TGACCACCAG CCAGAATGCA TTGGCATTAA TGGCCGACGG TGAAGACGCA AACCTGCAAA GTCAGCTTTA CACGGCTAAA CAACTGGTGA GCGAATTGAT TGGCATGGAC AGCAAACTGT CCGGCGTACT TGATATGCTG GAAGAAGCTA CCATCCAGAT TGCTGAAGCC AGCGATGAAC TGCGCCACTA CTGCGATCGT CTGGACCTCG ATCCTAACCG ACTGTTTGAA CTTGAACAGC GCATCTCAAA ACAGATTTCG CTGGCACGTA AACATCACGT CAGCCCTGAG GCATTGCCAC AGTATTACCA GTCGCTACTG GAAGAACAGC AGCAACTGGA CGATCAGGCC GACTCACAAG AAACGCTCGC GCTGGCGGTA ACGAAACATC ATCAGCAGGC ACTGGAAACC GCGAACGCAT TACACCAGCA ACGCCAGCAC TATGCAGAAG AACTTGCACA GCTGATCACC GACAGTATGC ATGCACTCTC TATGCCGCAT GGGCAGTTTA CGATTGATGT TAAATTTGAC GAGCATCACC TGGGCGCTGA CGGCGCTGAT CGTATTGAAT TTCGGGTAAC CACCAACCCA GGTCAGCCAA TGCAGCCTAT TGCCAAAGTC GCATCCGGTG GTGAATTGTC ACGCATCGCA TTGGCAATTC AGGTCATCAC GGCGCGTAAA ATGGAAACCC CGGCACTGAT TTTTGATGAA GTGGATGTAG GGATTAGCGG CCCAACAGCG GCAGTTGTTG GCAAACTGCT GCGTCAACTC GGCGAATCAA CTCAGGTGAT GTGTGTTACC CACCTGCCAC AAGTCGCGGG ATGTGGTCAT CAACACTATT TTGTCAGCAA AGAAACCGAT GGTGCGATGA CAGAAACGCA TATGGAGTCG CTGGACAAAA AAGCGCGGTT ACAAGAGCTG GCGCGCCTGC TTGGTGGCAG TGAAGTCACA CGTAACACAC TGGCGAATGC GAAAGAACTG CTTGCGGCGT AA
|
Protein sequence | MLAQLTISNF AIVRELEIDF HSGMTVITGE TGAGKSIAID ALGLCLGGRA EADMVRTGAA RADLCARFSL KDTPAALRWL EENQLEDGHE CLLRRVISSD GRSRGFINGT AVPLSQLREL GQLLIQIHGQ HAHQLLTKPE HQKFLLDGYA NETSLLQEMT ARYQLWHQSC RDLAHHQQLS QERAARAELL QYQLKELNEF NPQPGEFEQI DEEYKRLANS GQLLTTSQNA LALMADGEDA NLQSQLYTAK QLVSELIGMD SKLSGVLDML EEATIQIAEA SDELRHYCDR LDLDPNRLFE LEQRISKQIS LARKHHVSPE ALPQYYQSLL EEQQQLDDQA DSQETLALAV TKHHQQALET ANALHQQRQH YAEELAQLIT DSMHALSMPH GQFTIDVKFD EHHLGADGAD RIEFRVTTNP GQPMQPIAKV ASGGELSRIA LAIQVITARK METPALIFDE VDVGISGPTA AVVGKLLRQL GESTQVMCVT HLPQVAGCGH QHYFVSKETD GAMTETHMES LDKKARLQEL ARLLGGSEVT RNTLANAKEL LAA
|
| |