Gene EcHS_A3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3901 
Symbol 
ID5592376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3895233 
End bp3896447 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content55% 
IMG OID640923009 
Producthypothetical protein 
Protein accessionYP_001460486 
Protein GI157163168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones80 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGA TAACCTTTGC TCCCCGTAAT CACCTGCTCA CCAATACCAA TACCTGGACG 
CCCGACAGCC AGTGGCTGGT ATTTGACGTG CGTCCTTCTG GCGCGTCGTT TACCGGCGAG
ACCATTGAGC GTGTGAATAT CCATACCGGC GAGGTCGAGG TTATCTATCG CGCGTCACAG
GGCGCACACG TCGGCGTGGT GACCGTTCAT CCAAAGTCAG AGAAATATGT CTTTATTCAC
GGCCCGGAAA ATCCTGATGA AACATGGTAT TACGATTTCC ATCACCGTCG CGGCGTGATT
GCTGAAAGCG GCAAGGTGAG CAATCTCGAT GCAATGGATA TTACTGCACC GTACACCCCA
GGAGCGCTGC GCGGCGGCAG CCATGTGCAT GTCTTTAGCC CGAACGGTGA AAGGGTGAGC
TTTACCTATA ACGACCATGT AATGCAAGAA CTCGATCCGG CGCTGGATTT GCGAAACGTC
GGTGTTGCTG CGCCGTTTGG CCCGGTCAAC GTACAAAAGC AGCATCCGCG TGAATACAGC
GGTAGCCACT GGTGCGTGCT GGTGAGCAAA ACCACGCCCA CGCCACAGCC TGGCAGTGAT
GAAATCAATC GTGCTTATGA AGAAGGATGG GTAGGAAATC ACGCGCTGGC ATTTATTGGC
GACACACTTT CGCCAAAGGG CGAGAAAGTG CCGGAGCTGT TTATCGTTGA GTTACCGCAA
GATGAAGCTG GCTGGAAAGC GGCAGGTGAT GCGCCGTTAA GCGGAACGGA AACAACCCTG
CCCGCGCCAC CGCGTGGCGT CGTGCAGCGA CGTTTAACCT TTACCCACCA TCGGGCTTAT
CCGGGGTTAG TCAACGTCCC GCGCCACTGG GTGCGCTGTA ATCCGCAGGG TACGCAAATC
GCGTTTTTAA TGCGTGATGA TAACGGCATT GTGCAACTGT GGCTTATCTC GCCACAGGGC
GGCGAGCCGC GCCAGTTAAC CCATAACAAA ACGGATATTC AGTCTGCATT TAACTGGCAT
CCGTCAGGAG AATGGTTGGG CTTTGTGCTG GATAATCGAA TTGCTTGTGC CCATGCGCAA
AGTGGCGAGG TTGAGTATTT AACCGAAAAC CACGCCAATC CGCCTTCTGC GGATGCCGTG
GTCTTCTCAC CGGATGGTCA ATGGCTGGCG TGGATGGAAG GTGGCCAGCT GTGGATCACC
GAAACTGATC GCTAA
 
Protein sequence
MKQITFAPRN HLLTNTNTWT PDSQWLVFDV RPSGASFTGE TIERVNIHTG EVEVIYRASQ 
GAHVGVVTVH PKSEKYVFIH GPENPDETWY YDFHHRRGVI AESGKVSNLD AMDITAPYTP
GALRGGSHVH VFSPNGERVS FTYNDHVMQE LDPALDLRNV GVAAPFGPVN VQKQHPREYS
GSHWCVLVSK TTPTPQPGSD EINRAYEEGW VGNHALAFIG DTLSPKGEKV PELFIVELPQ
DEAGWKAAGD APLSGTETTL PAPPRGVVQR RLTFTHHRAY PGLVNVPRHW VRCNPQGTQI
AFLMRDDNGI VQLWLISPQG GEPRQLTHNK TDIQSAFNWH PSGEWLGFVL DNRIACAHAQ
SGEVEYLTEN HANPPSADAV VFSPDGQWLA WMEGGQLWIT ETDR