Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3005 |
Symbol | recN |
ID | 6272544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2802667 |
End bp | 2804328 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641726941 |
Product | recombination and repair protein |
Protein accession | YP_001881406 |
Protein GI | 187731758 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0497] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00634] DNA repair protein RecN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000000658292 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGGCAC AACTGACCAT CAGCAACTTT GCTATCGTTC GTGAGCTTGA GATTGATTTT CATAGCGGCA TGACCGTAAT AACTGGCGAG ACCGGCGCGG GTAAATCTAT TGCAATAGAT GCCCTCGGTC TTTGTCTCGG TGGTCGCGCT GAAGCCGACA TGGTGCGTAC CGGCGCTGCT CGCGCTGACC TGTGCGCCCG TTTTTCTCTG AAAGATACGC CAGCGGCTCT GCGCTGGCTG GAAGAAAATC AGCTTGAAGA CGGGCATGAA TGTTTGCTTC GTCGCGTGAT CAGCAGCGAT GGTCGCTCCC GTGGTTTCAT CAACGGTACA GCTGTTCCTC TGTCACAACT GCGCGAACTG GGTCAGTTGC TGATTCAGAT CCATGGTCAG CACGCTCATC AATTACTCAC CAAACCTGAG CACCAAAAAT TCCTGCTTGA TGGCTATGCC AATGAAACCT CTCTACTGCA GGAAATGACC GCACGTTATC AGTTGTGGCA TCAAAGCTGC CGTGACCTCG CGCATCATCA ACAGTTAGGT CAGGAACGCG CCGCCCGTGC GGAACTGCTG CAATACCAAT TAAAAGAACT TAACGAATTT AATCCGCAGC CCGGAGAGTT TGAACAAATC GACGAAGAGT ACAAACGTCT GGCGAACAGC GGTCAATTGC TGACCACCAG CCAGAATGCA TTGGCATTAA TGGCCTACGG TGAAGACGCA AACCTGCAAA GTCAGCTTTA CACGGCTAAA CAACTGGTGA GCGAATTGAT TGGCATGGAC AGCAAACTGT CCGGCGTACT TGATATGCTG GAAGAAGCTA CCATCCAGAT TGCTGAAGCC AGCGATGAAC TGCGCCACTA CTGCGATCGT CTGGATCTCG ATCCCAACCG ACTATTTGAA CTTGAACAGC GCATCTCAAA ACAGATTTCG CTGGCACGTA AACATCACGT CAGCCCTGAG GCATTGCCAC AGTATTACCA GTCGCTACTG GAAGAACAGC AGCAACTGGA CGATCAGGCC GACTCACAAG AAACGCTTGC GCTGGCGGTA ACGAAACATC ATCAGCAGGC ACTGGAAATC GCGCGCGCAT TACACCAACA ACGCCAGCAA TATGCAGAAG AACTTGCACA GCTGATCACC GACAGTATGC ATGCGCTCTC AATGCCGCAT GGGCAGTTTA CGATCGATGT TAAATTTGAC GAGCATCACC TGGGCGCTGA CGGTGCCGAT CGTATTGAGT TTCGGGTAAC CACCAACCCA GGTCAGCCAA TGCAGCCTAT TGCCAAAGTC GCATCCGGTG GTGAATTGTC CCGCATCGCA CTGGCAACCC AGGTCATCAC GGCGCGTAAA ATGGAAACCC CGGCACTGAT TTTTGATGAA GTGGATGTAG GGATTAGCGG TCCAACAGCG GCAGTTGTCG GCAAACTGCT GCGTCAACTC GGCGAATCAA CTCAGGTGAT GTGTGTTACC CACCTGCCAC AAGTCGCGGG ATGTGGTCAT CAACACTATT TTGTCAGCAA AGAAACCGAT GGTGCGATGA CAGAAACGCA TATGCAATCC CTGAATAAAA AAGCGCGGTT ACAAGAGCTG GCGCGCCTGC TTGGTGGCAG TGAAGTCACA CGTAATACAC TGGCGAATGC GAAAGAACTG CTTGCAGCGT AA
|
Protein sequence | MLAQLTISNF AIVRELEIDF HSGMTVITGE TGAGKSIAID ALGLCLGGRA EADMVRTGAA RADLCARFSL KDTPAALRWL EENQLEDGHE CLLRRVISSD GRSRGFINGT AVPLSQLREL GQLLIQIHGQ HAHQLLTKPE HQKFLLDGYA NETSLLQEMT ARYQLWHQSC RDLAHHQQLG QERAARAELL QYQLKELNEF NPQPGEFEQI DEEYKRLANS GQLLTTSQNA LALMAYGEDA NLQSQLYTAK QLVSELIGMD SKLSGVLDML EEATIQIAEA SDELRHYCDR LDLDPNRLFE LEQRISKQIS LARKHHVSPE ALPQYYQSLL EEQQQLDDQA DSQETLALAV TKHHQQALEI ARALHQQRQQ YAEELAQLIT DSMHALSMPH GQFTIDVKFD EHHLGADGAD RIEFRVTTNP GQPMQPIAKV ASGGELSRIA LATQVITARK METPALIFDE VDVGISGPTA AVVGKLLRQL GESTQVMCVT HLPQVAGCGH QHYFVSKETD GAMTETHMQS LNKKARLQEL ARLLGGSEVT RNTLANAKEL LAA
|
| |