Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4278 |
Symbol | |
ID | 5593811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4282732 |
End bp | 4284027 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640923380 |
Product | hypothetical protein |
Protein accession | YP_001460825 |
Protein GI | 157163507 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 0.848192 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAG ATGCGCAGGG GTATATCGAC CTGTCTGATT TGGATTTAAC AAGTTGTCAT TTTAAAGGTG ACGTTATATC GAAGGTGTCT TTTTTATCAT CAAATCTACA ACATGTAACA TTCGAATGTA AAGAAATTGG GGATTGCAAT TTTACTACTG CAATAGTTGA TAATGTCATA TTTAGATGTC GACGTTTACA CAATGTGATT TTTATCAAAG CGAGTGGTGA ATGTGTCGAT TTCAGCAAAA ATATTCTTGA TACAGTTGAC TTCTCGCAGA GTCAACTTAC TCAGAGTAAT TTTCGCGAAT ATCAGATTAG AAATTCAAAC TTCGATAATT GTTATCTTTA CGCTTCGCAC TTCACCAGAG CAGAGTTTCT GTCTGCCAAA GAAATATCAT TTATTAAATC GAATATGACA GCTGTTATGT TTGATCATGT GCGAATATCG ACAGGGAATT TTAAAGATTG CATTACAGAA CAATTGGAAT TAACTATTGA TTATTCAGAT ATATTTGGGA ATGAAGATCT CGATGGTTAT ATCAATAACA TTATAAAAAT GATTGATACA TTGCCAGATA ATGCAATGAT ATTGAAATCC GTTCTGGCAG TAAAACTGGT TATGCAATTA AAAATTCTTA ATATTGTTAA TAAAAACTTT ATTGAGAATA TGAAGAAAAC ATTTAGCCAT TGTCCTTATA TAAAAGATCC AATTATACGC AGTTATATCC ATTCTGGTGA AGATAACAAG TTCGATGATT TTATGCGTCA ACATCGATTC AGCAAGGTGG ATTTCGATAC CCAACAGATG ATCCATTTTA TTAACAGGTT TAATATGAAT AAAGGGCTGA TTGATAAAAA TAACAATTTT TTTATCCAAC TTATCGATCA GGCCTTACGA TCAACGGATG ATATGATCAA AGCAAATGCC TGGTATCTTT ATAAAGAGTG GATTCGTAGT GATGATGTTT CACCTCTATT TATAGAAATT GAAGATAATT TAAGAACCTT TAACACGAAT GAATTAACAC GAAAGGATAA TATCTTTATC CTGTTCTCCT CTGCCGATGA TGGGCCAGTT ATGGTGGTAA GCTCCCAGCG CTTACATGAT ATGTTGAATC CTACAAAAGA TACCAATTGG AATTCCACGT GTATCTATAA ATCCAGACAT AAGATGTTGC CTATTAATCT TACTCAGGAA ACACTTTTTA GCTCCAAATC TCATGGTAAA TATGCGCTTT TCCCAATTTT TACTGCGAGT TGGCGAGCTA CTCGTATAAA GAATATAGGT ATTTAA
|
Protein sequence | MDKDAQGYID LSDLDLTSCH FKGDVISKVS FLSSNLQHVT FECKEIGDCN FTTAIVDNVI FRCRRLHNVI FIKASGECVD FSKNILDTVD FSQSQLTQSN FREYQIRNSN FDNCYLYASH FTRAEFLSAK EISFIKSNMT AVMFDHVRIS TGNFKDCITE QLELTIDYSD IFGNEDLDGY INNIIKMIDT LPDNAMILKS VLAVKLVMQL KILNIVNKNF IENMKKTFSH CPYIKDPIIR SYIHSGEDNK FDDFMRQHRF SKVDFDTQQM IHFINRFNMN KGLIDKNNNF FIQLIDQALR STDDMIKANA WYLYKEWIRS DDVSPLFIEI EDNLRTFNTN ELTRKDNIFI LFSSADDGPV MVVSSQRLHD MLNPTKDTNW NSTCIYKSRH KMLPINLTQE TLFSSKSHGK YALFPIFTAS WRATRIKNIG I
|
| |