Gene EcHS_A1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1076 
Symbol 
ID5591601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1090393 
End bp1091583 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID640920241 
Producthypothetical protein 
Protein accessionYP_001457806 
Protein GI157160488 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.00716941 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTAC GTTTAGTGTT AGCCAAAGGG CGCGAAAAAT CATTACTTCG TCGCCATCCG 
TGGGTCTTTT CCGGGGCCGT TGCCCGCATG GAAGGTAAAG CCAGCCTCGG TGAAACCATC
GATATTGTTG ATCATCAGGG AAAATGGTTA GCACGCGGCG CTTATTCGCC AGCTTCGCAA
ATCCGGGCGC GCGTCTGGAC GTTTGACCCG TCTGAGTCTA TCGACATTGC TTTTTTTTCC
CGCCGTTTGC AACAAGCACA AAAATGGCGT GACTGGCTGG CGCAAAAAGA TGGCCTCGAC
AGCTATCGTT TAATCGCCGG AGAATCTGAT GGCCTGCCGG GTATTACTAT CGATCGTTTC
GGTAATTTTC TGGTGCTGCA ACTGCTGAGT GCTGGGGCAG AATATCAGCG CGCGGCATTA
ATTAGTGCCC TGCAAACGCT GTACCCGGAA TGTTCGATTT ACGATCGCAG CGACGTCGCG
GTACGTAAAA AAGAAGGAAT GGAGCTGACC CAGGGCCCCG TCACCGGCGA GTTGCCACCT
GCCCTGCTGC CGATTGAAGA ACACGGAATG AAACTGCTGG TGGATATTCA GCACGGACAC
AAAACGGGCT ACTACCTGGA CCAGCGTGAT AGCCGCCTGG CTACCCGCCG CTACGTTGAA
AATAAACGTG TGCTGAACTG TTTCTCCTAT ACCGGTGGTT TCGCCGTATC GGCACTGATG
GGCGGTTGCA GCCAGGTTGT CAGCGTTGAT ACCTCCCAGG AAGCGCTGGA TATTGCACGG
CAGAACGTTG AGCTGAACAA ACTGGATCTG AGCAAGGCTG AGTTTGTCCG TGATGATGTC
TTTAAATTGC TGCGTACTTA TCGCGATCGC GGTGAAAAAT TTGACGTTAT CGTGATGGAC
CCGCCGAAGT TTGTTGAGAA TAAAAGCCAG TTGATGGGCG CGTGTCGGGG CTATAAAGAT
ATCAACATGC TGGCGATTCA GTTGCTGAAT GAAGGCGGTA TTCTCCTGAC TTTCTCCTGT
TCCGGTCTGA TGACCAGCGA TTTATTTCAG AAAATCATCG CGGATGCCGC AATTGATGCC
GGCCGTGATG TACAATTTAT AGAGCAGTTC CGTCAGGCAG CCGATCATCC GGTGATCGCT
ACCTACCCGG AAGGGCTATA TCTGAAAGGG TTTGCCTGTC GCGTCATGTA A
 
Protein sequence
MSVRLVLAKG REKSLLRRHP WVFSGAVARM EGKASLGETI DIVDHQGKWL ARGAYSPASQ 
IRARVWTFDP SESIDIAFFS RRLQQAQKWR DWLAQKDGLD SYRLIAGESD GLPGITIDRF
GNFLVLQLLS AGAEYQRAAL ISALQTLYPE CSIYDRSDVA VRKKEGMELT QGPVTGELPP
ALLPIEEHGM KLLVDIQHGH KTGYYLDQRD SRLATRRYVE NKRVLNCFSY TGGFAVSALM
GGCSQVVSVD TSQEALDIAR QNVELNKLDL SKAEFVRDDV FKLLRTYRDR GEKFDVIVMD
PPKFVENKSQ LMGACRGYKD INMLAIQLLN EGGILLTFSC SGLMTSDLFQ KIIADAAIDA
GRDVQFIEQF RQAADHPVIA TYPEGLYLKG FACRVM