Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00641 |
Symbol | speF |
ID | 8114471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 677544 |
End bp | 679742 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644846914 |
Product | hypothetical protein |
Protein accession | YP_002998487 |
Protein GI | 251784183 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1982] Arginine/lysine/ornithine decarboxylases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.94656 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAT TAAAAATTGC GGTTAGTGAT TCTTGCCCGG ACTGTTTTAC CACGCAGCGA GAATGTATCT ACATTAATGA AAGTCGTAAT ATCGATGTGG CGGCAATAGT TTTATCGCTC AACGATGTTA CATGCGGAAA ACTCGATGAA ATCGATGCCA CGGGTTATGG CATCCCGGTA TTTATTGCTA CTGAAAATCA AGAACGTGTA CCCGCAGAGT ATTTGCCCCG TATTTCGGGT GTCTTTGAGA ATTGCGAATC GCGACGAGAA TTTTATGGTC GCCAGTTAGA AACCGCTGCC AGCCATTATG AAACTCAACT GCGCCCACCT TTCTTCCGCG CACTGGTCGA TTATGTCAAT CAAGGTAACA GCGCGTTTGA TTGCCCTGGT CATCAGGGCG GCGAATTTTT CCGTCGCCAT CCGGCAGGGA ATCAGTTTGT GGAATACTTT GGTGAGGCGC TGTTCCGTGC CGACTTGTGC AACGCCGACG TAGCGATGGG CGATCTGCTG ATTCACGAAG GCGCGCCATG CATTGCACAG CAACATGCGG CAAAAGTGTT TAATGCCGAT AAAACCTACT TCGTTTTAAA TGGCACTTCA TCTTCTAACA AAGTGGTTTT AAACGCCCTG CTAACACCGG GTGATCTGGT GCTGTTTGAT CGCAATAACC ACAAATCTAA CCACCACGGA GCGTTGCTAC AGGCTGGTGC AACACCGGTT TATCTGGAAA CGGCACGTAA CCCGTATGGC TTTATCGGTG GCATTGATGC GCACTGTTTT GAAGAAAGTT ACCTGCGTGA GCTGATCGCG GAAGTCGCAC CGCAGCGGGC AAAAGAGGCT CGTCCTTTCC GCCTCGCTGT GATTCAGTTA GGCACCTACG ACGGTACGAT TTATAACGCC CGCCAAGTGG TGGATAAAAT TGGTCATCTG TGTGACTACA TCCTGTTTGA CTCAGCATGG GTCGGCTATG AACAGTTTAT TCCGATGATG GCGGACTGTT CGCCGCTGTT GCTGGATCTT AATGAGAACG ATCCGGGTAT TCTGGTTACG CAATCTGTGC ATAAACAACA GGCTGGTTTT TCTCAGACTT CACAAATTCA TAAAAAAGAC AGCCACATCA AAGGGCAACA GCGTTATGTA CCGCACAAAC GCATGAACAA CGCCTTTATG ATGCACGCCT CCACCAGCCC GTTCTATCCG CTGTTTGCCG CACTGGATAT CAACGCCAAA ATGCATGAAG GTGTCAGCGG TCGTAATATG TGGATGGATT GTGTGGTAAA TGGCATTAAT GCCCGCAAAC TGATCCTCGA TAACTGTCAG CATATTCGTC CGTTCGTACC TGAACTGGTG GATGGTAAAC CCTGGCAGTC GTATGAAACA GCGCAAATTG CGGTTGATCT GCGCTTCTTC CAGTTTGTAC CAGGGGAACA CTGGCATTCT TTTGAAGGCT ATGCAGAGAA TCAATACTTT GTCGATCCAT GCAAACTGTT GCTGACAACC CCAGGTATTG ATGCACGTAA CGGCGAATAT GAAGCGTTCG GTGTACCCGC GACGATTCTT GCTAACTTCC TGGGCGAAAA TGGCGTAGTG CCGGAAAAAT GCGATCTTAA CTCCATCCTC TTCCTGCTGA CTCCGGCAGA AGATATGGCC AAACTTCAGC AACTTGTTGC CCTGCTGGTA CGCTTCGAAA AACTGCTTGA GTCCGACGCG CCATTAGCAG AAGTGCTACC TTCCATCTAC AAACAGCATG AAGAGCGCTA CGCCGGTTAT ACCCTGCGTC AGTTGTGTCA GGAAATGCAT GATTTGTATG CCCGCCACAA CGTGAAACAA CTGCAAAAAG AGATGTTCCG TAAGGAGCAC TTCCCACGCG TCAGCATGAA TCCGCAAGAA GCCAACTACG CCTATTTACG CGGTGAAGTG GAACTGGTTC GTCTGCCGGA TGCAGAAGGC CGTATCGCTG CCGAAGGTGC GCTTCCTTAT CCTCCGGGTG TGCTGTGTGT TGTTCCGGGT GAAATCTGGG GTGGTGCTGT TCTGCGTTAC TTCAGCGCTC TGGAAGAAGG GATCAACCTG CTGCCAGGTT TTGCACCGGA GCTGCAGGGT GTCTATATCG AAGAACATGA TGGTCGTAAG CAAGTTTGGT GCTATGTCAT CAAGCCTCGT GATGCGCAAA GCACCCTGTT GAAAGGGGAA AAATTATGA
|
Protein sequence | MSKLKIAVSD SCPDCFTTQR ECIYINESRN IDVAAIVLSL NDVTCGKLDE IDATGYGIPV FIATENQERV PAEYLPRISG VFENCESRRE FYGRQLETAA SHYETQLRPP FFRALVDYVN QGNSAFDCPG HQGGEFFRRH PAGNQFVEYF GEALFRADLC NADVAMGDLL IHEGAPCIAQ QHAAKVFNAD KTYFVLNGTS SSNKVVLNAL LTPGDLVLFD RNNHKSNHHG ALLQAGATPV YLETARNPYG FIGGIDAHCF EESYLRELIA EVAPQRAKEA RPFRLAVIQL GTYDGTIYNA RQVVDKIGHL CDYILFDSAW VGYEQFIPMM ADCSPLLLDL NENDPGILVT QSVHKQQAGF SQTSQIHKKD SHIKGQQRYV PHKRMNNAFM MHASTSPFYP LFAALDINAK MHEGVSGRNM WMDCVVNGIN ARKLILDNCQ HIRPFVPELV DGKPWQSYET AQIAVDLRFF QFVPGEHWHS FEGYAENQYF VDPCKLLLTT PGIDARNGEY EAFGVPATIL ANFLGENGVV PEKCDLNSIL FLLTPAEDMA KLQQLVALLV RFEKLLESDA PLAEVLPSIY KQHEERYAGY TLRQLCQEMH DLYARHNVKQ LQKEMFRKEH FPRVSMNPQE ANYAYLRGEV ELVRLPDAEG RIAAEGALPY PPGVLCVVPG EIWGGAVLRY FSALEEGINL LPGFAPELQG VYIEEHDGRK QVWCYVIKPR DAQSTLLKGE KL
|
| |