Gene EcolC_3676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3676 
Symbol 
ID6065963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4025810 
End bp4027360 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content53% 
IMG OID641603091 
Producthypothetical protein 
Protein accessionYP_001726614 
Protein GI170021660 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0063204 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCACTT CTGATGAAAA TGCACTGCAA CAACGTTGCC AGCAAATTGT CACCAGCCCA 
GTGCTTAGCC CGGAGCAGAA GCGCCATTTT CTGGCACTTG AAGCAGAAAA CAATCTGCCT
TATCCGCAGC TGCCTGCCGA AGCCCGCCGT GCGCTGGATG AGGGTGTAAT CTGCGATATG
TTTGAAGGTC ATGCGCCGTA CAAACCGCGC TATGTCTTAC CCGATTACGC CCGTTTTCTG
GCGAACGGTT CCGAATGGCT GGAGCTGGAA GGCGCGAAAG ATCTTGATGA CGCACTCTCT
CTGCTGACCA TTCTTTACCA CCACGTACCG TCGGTCACAT CGATGCCGGT CTACCTGGGG
CAACTGGATG CGTTGTTGCA ACCGTATGTT AGAATTCTAA CACAAGACGA GATCGATATT
CGAATAAAAC GTTTCTGGCG TTACCTCGAC AGAACCCTGC CAGACGCCTT TATGCACGCC
AATATCGGCC CGTCTGATTC GCCCATTACC CGTGCAATCT TACGTGCAGA CGCAGAGTTG
AAGCAGGTTT CACCGAACCT GACCTTTATC TACGATCCTG AAATCACCCC TGATGACCTA
CTGCTGGAAG TGGCGAAGAA CATCTGTGAA TGTAGCAAAC CGCACATCGC CAACGGTCCG
GTGCATGATA AAATTTTCAC AAAAGGGGGC TACGGGATTG TGAGCTGTTA CAACTCACTG
CCGCTGGCGG GTGGTGGCAG CACGCTGGTA CGCCTTAACC TGAAAGCCAT TGCCGAGCGC
AGCGAATCGC TGGATGACTT CTTTACGCGC ACTCTACCGC ACTACTGCCA GCAGCAGATC
GCCATCATCG ATGCGCGGTG TGAATTCCTC TATCAACAAT CACACTTCTT TGAGAATAGC
TTCCTGGTGA AAGAAGGGCT GATTAACCCT GAACGTTTTG TGCCAATGTT TGGCATGTAT
GGGCTGGCGG AAGCGGTTAA CTTGCTGTGT GAGAAAGAAG GGATTGCCGC GCGCTACGGT
AAAGAAGCCG CCGCAAATGA AGTGGGTTAT CGCATCAGCG CGCAACTGGC GGAGTTTGTC
GCCAATACCC CCGTGAAATA TGGCTGGCAA AAACGCGCCA TGTTACACGC ACAGTCGGGG
ATCAGTTCCG ATATCGGCAC CACGCCGGGC GCGCGTTTAC CCTATGGCGA TGAGCCAGAT
CCGATCACCC ATCTGCAAAC TGTCGCCCCG CATCATGCTT ATTATTATTC CGGCATCAGC
GACATTCTGA CGCTCGACGA AACCATCAAA CGTAATCCGC AGGCGCTGGT ACAGCTTTGC
CTCGGTGCCT TTAAAGCCGG AATGCGTGAA TTTACCGCCA ATGTCAGCGG TAACGATCTG
GTTCGCGTTA CCGGTTATAT GGTGCGTTTG TCGGATTTAG AAAAATATCG CGCCGAAGGT
TCACGCACCA ACACCACCTG GCTGGGCGAA GAAGCCGCAC GCAACACTCG TATTCTGGAA
CGCCAGCCGC GCGTGATAAG CCATGAACAG CAGATGCGCT TTAGTCAGTA A
 
Protein sequence
MPTSDENALQ QRCQQIVTSP VLSPEQKRHF LALEAENNLP YPQLPAEARR ALDEGVICDM 
FEGHAPYKPR YVLPDYARFL ANGSEWLELE GAKDLDDALS LLTILYHHVP SVTSMPVYLG
QLDALLQPYV RILTQDEIDI RIKRFWRYLD RTLPDAFMHA NIGPSDSPIT RAILRADAEL
KQVSPNLTFI YDPEITPDDL LLEVAKNICE CSKPHIANGP VHDKIFTKGG YGIVSCYNSL
PLAGGGSTLV RLNLKAIAER SESLDDFFTR TLPHYCQQQI AIIDARCEFL YQQSHFFENS
FLVKEGLINP ERFVPMFGMY GLAEAVNLLC EKEGIAARYG KEAAANEVGY RISAQLAEFV
ANTPVKYGWQ KRAMLHAQSG ISSDIGTTPG ARLPYGDEPD PITHLQTVAP HHAYYYSGIS
DILTLDETIK RNPQALVQLC LGAFKAGMRE FTANVSGNDL VRVTGYMVRL SDLEKYRAEG
SRTNTTWLGE EAARNTRILE RQPRVISHEQ QMRFSQ