Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3901 |
Symbol | |
ID | 5592376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3895233 |
End bp | 3896447 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640923009 |
Product | hypothetical protein |
Protein accession | YP_001460486 |
Protein GI | 157163168 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 80 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGA TAACCTTTGC TCCCCGTAAT CACCTGCTCA CCAATACCAA TACCTGGACG CCCGACAGCC AGTGGCTGGT ATTTGACGTG CGTCCTTCTG GCGCGTCGTT TACCGGCGAG ACCATTGAGC GTGTGAATAT CCATACCGGC GAGGTCGAGG TTATCTATCG CGCGTCACAG GGCGCACACG TCGGCGTGGT GACCGTTCAT CCAAAGTCAG AGAAATATGT CTTTATTCAC GGCCCGGAAA ATCCTGATGA AACATGGTAT TACGATTTCC ATCACCGTCG CGGCGTGATT GCTGAAAGCG GCAAGGTGAG CAATCTCGAT GCAATGGATA TTACTGCACC GTACACCCCA GGAGCGCTGC GCGGCGGCAG CCATGTGCAT GTCTTTAGCC CGAACGGTGA AAGGGTGAGC TTTACCTATA ACGACCATGT AATGCAAGAA CTCGATCCGG CGCTGGATTT GCGAAACGTC GGTGTTGCTG CGCCGTTTGG CCCGGTCAAC GTACAAAAGC AGCATCCGCG TGAATACAGC GGTAGCCACT GGTGCGTGCT GGTGAGCAAA ACCACGCCCA CGCCACAGCC TGGCAGTGAT GAAATCAATC GTGCTTATGA AGAAGGATGG GTAGGAAATC ACGCGCTGGC ATTTATTGGC GACACACTTT CGCCAAAGGG CGAGAAAGTG CCGGAGCTGT TTATCGTTGA GTTACCGCAA GATGAAGCTG GCTGGAAAGC GGCAGGTGAT GCGCCGTTAA GCGGAACGGA AACAACCCTG CCCGCGCCAC CGCGTGGCGT CGTGCAGCGA CGTTTAACCT TTACCCACCA TCGGGCTTAT CCGGGGTTAG TCAACGTCCC GCGCCACTGG GTGCGCTGTA ATCCGCAGGG TACGCAAATC GCGTTTTTAA TGCGTGATGA TAACGGCATT GTGCAACTGT GGCTTATCTC GCCACAGGGC GGCGAGCCGC GCCAGTTAAC CCATAACAAA ACGGATATTC AGTCTGCATT TAACTGGCAT CCGTCAGGAG AATGGTTGGG CTTTGTGCTG GATAATCGAA TTGCTTGTGC CCATGCGCAA AGTGGCGAGG TTGAGTATTT AACCGAAAAC CACGCCAATC CGCCTTCTGC GGATGCCGTG GTCTTCTCAC CGGATGGTCA ATGGCTGGCG TGGATGGAAG GTGGCCAGCT GTGGATCACC GAAACTGATC GCTAA
|
Protein sequence | MKQITFAPRN HLLTNTNTWT PDSQWLVFDV RPSGASFTGE TIERVNIHTG EVEVIYRASQ GAHVGVVTVH PKSEKYVFIH GPENPDETWY YDFHHRRGVI AESGKVSNLD AMDITAPYTP GALRGGSHVH VFSPNGERVS FTYNDHVMQE LDPALDLRNV GVAAPFGPVN VQKQHPREYS GSHWCVLVSK TTPTPQPGSD EINRAYEEGW VGNHALAFIG DTLSPKGEKV PELFIVELPQ DEAGWKAAGD APLSGTETTL PAPPRGVVQR RLTFTHHRAY PGLVNVPRHW VRCNPQGTQI AFLMRDDNGI VQLWLISPQG GEPRQLTHNK TDIQSAFNWH PSGEWLGFVL DNRIACAHAQ SGEVEYLTEN HANPPSADAV VFSPDGQWLA WMEGGQLWIT ETDR
|
| |