Gene EcolC_3084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3084 
Symbol 
ID6066207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3376741 
End bp3379713 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content55% 
IMG OID641602500 
Productbacteriophage N4 receptor, outer membrane subunit 
Protein accessionYP_001726035 
Protein GI170021081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAGA ATAACCTTAA TCGCGTCATC GGATGGTCTG GTTTACTGCT GACGTCTTTA 
TTGAGTACCA GCGCACTCGC AGACAATATC GGCACCAGCG CAGAAGAGCT GGGGCTGAGC
GATTATCGCC ATTTTGTTAT TTATCCCCGT CTCGATAAGG CGCTGAAGGC ACAGAAAAAT
AACGACGAAG CAACCGCCAT CCGCGAATTT GAATATATAC ACCAGCAGGT GCCGGATAAT
ATTCCGCTGA CTTTATACCT TGCGGAAGCC TATCGCCATT TTGGTCATGA TGACCGGGCG
CGGCTGTTGC TGGAGGATCA ACTGAAACGT CACCCAGGAG ATGCCCGACT TGAGCGCAGT
CTGGCGGCTA TTCCAGTTGA AGTGAAAAGC GTTACGACTG TTGAAGAACT GCTTGCCCAG
CAAAAAGCGT GCGATGCTGC GCCGACCCTG CGTTGTCGCA GTGAAGTCGG GCAGAATGCC
CTGCGGCTGG CACAGTTACC TGTCGCCAGA GCGCAACTGA ACGATGCGAC GTTTGCTGCA
TCGCCGGAAG GAAAAACGCT GCGAACCGAT CTGCTGCAAC GGGCAATCTA CCTGAAACAA
TGGTCCCAGG CAGATACGCT ATACAATGAA GCACGCCAGC AGAACACATT AAGCGCGGCA
GAACGCCGTC AGTGGTTTGA CGTGCTTCTT GCCGGGCAGC TGGACGATCG GATCCAGGCA
CTGCAATCAC AGGGGATCTT CACCGATCCT CAGTCATATA TTACTTACGC GACCGCGCTG
GCTTATCGTG GCGAAAAAGC ACGCCTCCAG CATTATCTCA TTGAAAATAA GCCGCTGTTT
ACCACGGACG CACAAGAGAA AAGTTGGCTC TATCTGTTAT CTAAATACAG CGCCAACCCC
GTTCAGGCAT TGGCGAATTA TACGGTGCAG TTTGCCGATA ACCGCCAGTA TGTTGTTGGC
GCGACGCTAC CGGTGCTGTT AAAAGAAGGT CAGTACGACG CAGCGCAAAA ACTGCTCGCC
ACCCTCCCCG CCAATGAAAT GCTTGAGGAG CGTTATGCTG TCAGCGTGGC GACCCGTAAC
AAGGCTGAAT CTCTGCGTCT GGCACGATTG CTGTATCAGC AAGAACCGGC AAATCTTACC
CGCCTGGATC AACTGACCTG GCAACTGATG CAGAACGAGC AGTCACGCGA AGCTGCCGAT
TTATTGCTGC AACGCTATCC TTTCCAGGGC GATGCGCGTG TCAGCCAGAC TTTAATGGCG
CGACTGGCGT CTCTGCTGGA AAGTCATCCT TACCTGGCAA CTCCAGCGAA GGTGGCGATT
TTATCAAAAC CCTTGCCGCT GGCGGCGCAA CGTCAGTGGC AAAGCCAGTT GCCGGGTATT
GCAGATAATT GCCCGGCAAT AGTTCGCTTG CTGGGCGATA TGTCGCCTTC CTACGATGCC
GCCGCCTGGA ACCGTCTGGC AAAGTGTTAT CGGGACACGC TACCCGGTGT GGCCTTGTAT
GCATGGCTTC AGGCCGAACA ACGACAACCG AACGCCTGGC AACATCGTGC GGTAGCCTAT
CAGGCGTATC AGGTTGAGGA CTACGCCACC GCACTGGCGG CCTGGCAGAA AATCAGTCTT
CACGACATGA GCAATGAGGA TCTGCTTGCT GCTGCCAATA CCGCCCAGGC GGCAGGAAAT
GGTGCAGCTC GCGATCGCTG GCTACAACAG GCAGAACAAC GTGGACTGGG AAACAATGCC
CTCTACTGGT GGCTGCATGC GCAACGTTAC ATTCCTGGTC AGCCGGAACT CGCACTGAGC
GATCTCACGC GCTCAATCAA TATTGCGCCT TCTGCCAACG CTTACGTTGC GCGGGCGACA
ATTTATCGCC AACGTCATAA TGTCCCGGCG GCGGTGAGTG ATTTGCGCGC CGCGCTGGAA
CTGGAACCGA ATAATAGCAA CATCCAGGCA GCGCTTGGTT ACGCCTTGTG GGATAGCGGT
GATATCGCAC AGTCGCGGGA AATGCTCGAA CAGGCGCATA AAGGGCTACC GGACGATCCG
GCACTGATCC GACAACTGGC CTATGTGAAC CAGCGTCTGG ATGACATGCC TGCGACGCAG
CACTACGCCC GCCTGGTGAT TGATGACATT GATAATCAGG CGCTGATAAA CCCACTCACC
CCAGAGCAAA ATCAGCAACG CTTCAATTTC CGCCGTCTGC ATGAGGAGGT CGGTCGCCGC
TGGACGTTCA GTTTCGATTC TTCTATCGGC TTGCGTTCCG GGGCAATGAG CACCGCTAAC
AATAATGTCG GCGGCGCAGC GCCAGGGAAA AGCTATCGTA GCTACGGGCA ACTGGAAGCC
GAGTACCGCA TCGGACGCAA TATGCTGCTG GAAGGCGACC TGCTCTCAGT TTATAGCCGC
GTCTTTGCCG ATACCGGAGA AAACGGGGTG ATGATGCCGG TGAAAAATCC GATGTCCGGC
ACCGGTCTAC GCTGGAAGCC GCTGCGCGAT CAGATCTTTT TCCTCGCCGT CGAACAGCAG
TTGCCGCTGA ACGGGCAAAA TGGCGCATCC GATACCATGC TGCGCGCCAG CGCCTCATTC
TTTAATGGCG GCAAATACAG CGACGAATGG CACCCGAACG GTTCAGGCTG GTTTGCCCAA
AACCTGTACC TCGATGCGGC GCAATATATC CGCCAGGATA TTCAGGCGTG GACGGCAGAT
TATCGCGTCA GCTGGCATCA GAAGGTAGCT AACGGACAGA CTATTGAGCC TTACGCTCAC
GTTCAGGACA ACGGCTATCG TGATAAAGGC ACTCAGGGCG CGCAGCTTGG CGGTGTCGGG
GTCCGCTGGA ATATCTGGAC CGGCGAGACG CACTACGACG CCTGGCCGCA CAAAGTCAGT
CTCGGCGTCG AGTATCAACA TACCTTTAAG GCGATTAATC AACGTAACGG AGAGCGCAAC
AACGCGTTTC TCACCATTGG AGTGCACTGG TAA
 
Protein sequence
MKENNLNRVI GWSGLLLTSL LSTSALADNI GTSAEELGLS DYRHFVIYPR LDKALKAQKN 
NDEATAIREF EYIHQQVPDN IPLTLYLAEA YRHFGHDDRA RLLLEDQLKR HPGDARLERS
LAAIPVEVKS VTTVEELLAQ QKACDAAPTL RCRSEVGQNA LRLAQLPVAR AQLNDATFAA
SPEGKTLRTD LLQRAIYLKQ WSQADTLYNE ARQQNTLSAA ERRQWFDVLL AGQLDDRIQA
LQSQGIFTDP QSYITYATAL AYRGEKARLQ HYLIENKPLF TTDAQEKSWL YLLSKYSANP
VQALANYTVQ FADNRQYVVG ATLPVLLKEG QYDAAQKLLA TLPANEMLEE RYAVSVATRN
KAESLRLARL LYQQEPANLT RLDQLTWQLM QNEQSREAAD LLLQRYPFQG DARVSQTLMA
RLASLLESHP YLATPAKVAI LSKPLPLAAQ RQWQSQLPGI ADNCPAIVRL LGDMSPSYDA
AAWNRLAKCY RDTLPGVALY AWLQAEQRQP NAWQHRAVAY QAYQVEDYAT ALAAWQKISL
HDMSNEDLLA AANTAQAAGN GAARDRWLQQ AEQRGLGNNA LYWWLHAQRY IPGQPELALS
DLTRSINIAP SANAYVARAT IYRQRHNVPA AVSDLRAALE LEPNNSNIQA ALGYALWDSG
DIAQSREMLE QAHKGLPDDP ALIRQLAYVN QRLDDMPATQ HYARLVIDDI DNQALINPLT
PEQNQQRFNF RRLHEEVGRR WTFSFDSSIG LRSGAMSTAN NNVGGAAPGK SYRSYGQLEA
EYRIGRNMLL EGDLLSVYSR VFADTGENGV MMPVKNPMSG TGLRWKPLRD QIFFLAVEQQ
LPLNGQNGAS DTMLRASASF FNGGKYSDEW HPNGSGWFAQ NLYLDAAQYI RQDIQAWTAD
YRVSWHQKVA NGQTIEPYAH VQDNGYRDKG TQGAQLGGVG VRWNIWTGET HYDAWPHKVS
LGVEYQHTFK AINQRNGERN NAFLTIGVHW