Gene EcE24377A_F0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_F0002 
Symbol 
ID5585660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009786 
Strand
Start bp1352 
End bp3262 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content63% 
IMG OID640913725 
Producthypothetical protein 
Protein accessionYP_001451375 
Protein GI157149386 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000524095 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCATA AAACAGACAC AGCCCCTGTA CAGGAGCAGG CAGGTCTGAC GTTTCGTCTG 
GAGACCTTTG AATGGCAGGT GCACCAGGGG CTTAACGAAG AGGCGGCCCG GTCCCTGATA
TCGCTCTTAC AGTTGCTGGA CCGACATTAT GCGCAGTGGG GGGAGAGCTT TTCCGCCTGG
GCGCCGGGGA TGACGGCAGA GGAGATAAAT CCCCATCTGT GCACCCGTAT TGCCGGGGCC
ATCACGGCGC TGTTCTCCCG TCCGGGGTTC CGGGTCAGCG ACGGCGGTTT TGCGGAGCTG
ATGGACTATC ACCGCTGGCT GGCCATTATT TTTGCCGTCA GCGACTACCG CCACGGCGAC
CATATCATCC GCAACATCAA CGCGGCCGGG GGCGGGGTGG TTGCCCCCCT GACCCTGAAC
GCGGATAATC TGCAGCTGTT CTGCCTGAGT TATTACCCGG ATTCACAGAT AGCCCTGCAG
CCGGAGCCGC TCTGGCAGTA TGACCGACAG ACGGTGGTCC GGCTGTTCTT TGCCCTGCTG
AGCGGTCGCG CCCTGCCGAC GCCGGCGGCG CACCAGAAGC GCGAGCATCT CCTGGCGTGG
CTGCCGGAGA GGCTGAAGGA GATTGATTCT CTGGAGTTTC TGCCCGGGAA GGTGCTGCAC
GATGTTTACA TGCACTGCTC CTATGCAGAT TTACCGGAAA AGCACCGCAT CAAGCAGGAA
ATCAACCGGC TGACGGCCCG GGCACTGGAG CAGACTTACG CAGACTGTCT GCCGGTACGC
GCGCCGGAAG CGGCGCGTCA GAAACCGGTG CTGGCGGTGG TGCTGGAGTG GTTTACCTGT
CAGCACAGCA TTTACCGGAC CCACTCCACC TCCATGCGCG CCCTGCGGGA GCACTTCCAC
CTGCTGGGTA TTGCGCAGCC CGGAGCGACG GACGAGATTA CCCGGGAGGT GTTTGATGAG
TTCCGGGAGC TGTCGGCGGA GAACGTTGTC GGGGATGCCA TCCGCTGCCT GAGTGAGGTG
CGCCCGGACG TGATTTACTA CCCGTCCGTG GGCATGTTCC CGCTGACCGT CTACCTGACG
GCCCTGCGCC TGGCTCCGTT GCAGCTGATG GCGCTGGGAC ACCCGGCCAC CACCTGGTCT
GAGCATATTG ATGGTGTCCT GGTGGAGGAA GACTACCTGG GAGACCCGGC ATGCTTCAGC
GAGACGGTCT GTGCCGTCCC GAAGGATGCG ATACCGTATA TTCCGCCGGC CAGCACGGAA
CGTGTCCTGC CGGAACGCAC ACCATTCCGT GACCGGGCGA AGGCGGCGTG GCCTGCGGCC
CTGCCGGTGC GGGTGGCTGT CTGTGCATCG GTCATGAAAA TCAACCCGGG CTTCCTGGAT
ACCCTGCGGG AAATCAGCGA CAGAAGCCGG GTGCCGGTTC AGTTCTGCTT CTGGATGGGC
TTTGCTCAGG GGCTGACGCT GGACTACCTG CGCCGGGCTA TCCGTCAGGC GCTGCCGACG
GCAGAAGTGA ATGCGCACAT GCCAGTCCAG GCATACCAGC AGGCGCTGAA CAGCTGTGAG
CTGTTTGTGA ACCCGTTCCC GTTTGGCAAC ACCAACGGCC TGGTGGATAC CGTGCGCCAG
GGGCTGCCCG GGGTGTGCAT GACGGGGCCG GAAGTCCACA CCCATATTGA TGAGGGGCTG
TTCAGACGCC TGGGCCTGCC GGAGGCCCTG ATTGCCCGCG ACCGCGAGGA GTACATCACG
GCGGTACTGT CCCTGACGGA GACGCCACGC CTGCGCGAGC GTCTGCAGAA ATACCTGACG
GAAAACGACG TGGAGAAGGT GCTGTTTGAA GGGCGTCCGG ATAAATTCGC GGAAAGGGTA
TGGCAGTTGT GGGAGGCGCG CAGCCATCGT CAGGAGGAGG GTGCCGAATG A
 
Protein sequence
MSHKTDTAPV QEQAGLTFRL ETFEWQVHQG LNEEAARSLI SLLQLLDRHY AQWGESFSAW 
APGMTAEEIN PHLCTRIAGA ITALFSRPGF RVSDGGFAEL MDYHRWLAII FAVSDYRHGD
HIIRNINAAG GGVVAPLTLN ADNLQLFCLS YYPDSQIALQ PEPLWQYDRQ TVVRLFFALL
SGRALPTPAA HQKREHLLAW LPERLKEIDS LEFLPGKVLH DVYMHCSYAD LPEKHRIKQE
INRLTARALE QTYADCLPVR APEAARQKPV LAVVLEWFTC QHSIYRTHST SMRALREHFH
LLGIAQPGAT DEITREVFDE FRELSAENVV GDAIRCLSEV RPDVIYYPSV GMFPLTVYLT
ALRLAPLQLM ALGHPATTWS EHIDGVLVEE DYLGDPACFS ETVCAVPKDA IPYIPPASTE
RVLPERTPFR DRAKAAWPAA LPVRVAVCAS VMKINPGFLD TLREISDRSR VPVQFCFWMG
FAQGLTLDYL RRAIRQALPT AEVNAHMPVQ AYQQALNSCE LFVNPFPFGN TNGLVDTVRQ
GLPGVCMTGP EVHTHIDEGL FRRLGLPEAL IARDREEYIT AVLSLTETPR LRERLQKYLT
ENDVEKVLFE GRPDKFAERV WQLWEARSHR QEEGAE