Gene EcHS_A4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4030 
Symbol 
ID5591743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4022267 
End bp4023736 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content49% 
IMG OID640923134 
Productputative lipoprotein 
Protein accessionYP_001460600 
Protein GI157163282 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTATC GCAATGTGGT TTGTTTGTTG TCTTGTTCAT TGTTCTTATC ATCGGCATGG 
GGATGTCGGC TTGATGAGCC GGAACATAAT ATTTACCAGA AACAAGGAAA AGGCGTGGTG
TACCTGCGCC CTTATGAAAA GACCAATCTT TCGCTACCGC AGATTAACTA CAAGCGTCTG
CGTCTGTTAC CGAATTTGTT AATTGACCCA ACAAAACTGA AGGATTGGGA AACGGTGCCC
CCGGCTACTG ATCTCACGAC AGACGTTGTC TACAGCGGCG CAAACGCAAC TCTTCCCCAT
TACTCTTACT ATAGCGATGG TCGCGCTATT CTCTACGCTG GCGAGATTGT GCAAAACCCT
CCAGGCACAC CGCCAGTCGA TATCTCGTCA TTTCAGGCAT GGGGTGATTT TGCCGCAGAT
AAGTACAGCC TCTATTACGA AGGCAAACGC ACCGATAGCA ACCAGCAACT AAACCGCCGA
ACGTTGCGTC AGGTAGAATT TAACCCGCAA TGGAAACCAG ACTGGCTAGG TTTGATTCTC
CGTGACAAGC ATTATCTTTA CGCAAATGGT CAGCGCCTTG ATGATCCTGA CACCTTTACG
GTACTGGCAC AAAAATCATG GGATCAGCGC GGTAAATTCT CTACAGCATT CAATCCCTGC
CTTCCTGCCC CATTTGGCCC CTGGGATACC CTGGCTCGTA CACGGACCAA AATCCTGATC
AACAGCGAAC AGCTTGATGC CGACCCGAAC ACCTTTTCCG TCGTACGCTG GATGCCCGGC
TCACTCTTGA CCTGGCGTGA TAAAAACGGG CTACAGCGTA AAGTCCTCGA CAAGGAAAAT
CTGGCGTGGG ATGAAGATTT AACAAAGCAC TGTCTGGATT TTTCTCTGCT GGAAAAGAAA
GTGTTCTGGC GTAAAGGGCC TGCTTGTAAA CAGGAAGAAT TACCCGGACT CGATCCGGAA
CAGTTTCACC CCATCAGTGA TGCTGTCGCC CAGTATCAGG ACTCGCTTTA TACCATCATC
GAAACAGAGT CTGGTGACCG CAAGCTGGAG ATCGTGAAAC TTGATGATCC CAATCTTATT
ATCAACAAAC GTTTCAACGC CGGGAAACGC CACGGCTATT TACTTACGCG TGCCGAAGGG
TGGCCATACC ATTCCGGTTT ACACGTGTTT GAATCTGACG GACCGCTGAT CTTACTGGAT
AACCACTCTC CGGATGAACG CGAAGCCCAT CTTAATGACC ATCCCTTTTT GCGCAGATGG
TATGCCCGCG ATAACCGCTA CGTTTACAGC TTTGATGGCG CGCAGCTCTG GCGATACCGC
ACCGCTGATC CGAAACAAGT TCGCTTAATC TGGAAGGAAC AACATTCGGG ATATGGCTAT
GGCGTAAATT ACAAAACGGG ATATCTGGAC GGAAAAATTA CTGATGACGG CGAATTTATT
CCTGCCCCGC GCAATGAGGC GACAAAATGA
 
Protein sequence
MHYRNVVCLL SCSLFLSSAW GCRLDEPEHN IYQKQGKGVV YLRPYEKTNL SLPQINYKRL 
RLLPNLLIDP TKLKDWETVP PATDLTTDVV YSGANATLPH YSYYSDGRAI LYAGEIVQNP
PGTPPVDISS FQAWGDFAAD KYSLYYEGKR TDSNQQLNRR TLRQVEFNPQ WKPDWLGLIL
RDKHYLYANG QRLDDPDTFT VLAQKSWDQR GKFSTAFNPC LPAPFGPWDT LARTRTKILI
NSEQLDADPN TFSVVRWMPG SLLTWRDKNG LQRKVLDKEN LAWDEDLTKH CLDFSLLEKK
VFWRKGPACK QEELPGLDPE QFHPISDAVA QYQDSLYTII ETESGDRKLE IVKLDDPNLI
INKRFNAGKR HGYLLTRAEG WPYHSGLHVF ESDGPLILLD NHSPDEREAH LNDHPFLRRW
YARDNRYVYS FDGAQLWRYR TADPKQVRLI WKEQHSGYGY GVNYKTGYLD GKITDDGEFI
PAPRNEATK