Gene EcolC_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1984 
Symbol 
ID6068177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2192542 
End bp2194554 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content53% 
IMG OID641601398 
Productfusaric acid resistance protein region 
Protein accessionYP_001724957 
Protein GI170020003 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0152424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000622676 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGCAT CGTCATGGTC CTTGCGCAAT TTGCCCTGGT TCAGGGCCAC GCTGGCGCAA 
TGGCGTTATG CGTTACGCAA TACCATTGCC ATGTGTCTGG CGCTGACGGT TGCCTATTAT
TTAAATCTGG ATGAACCCTA TTGGGCGATG ACCTCGGCTG CAGTGGTTAG CTTTCCCACC
GTTGGCGGTG TTATCAGCAA AAGCCTCGGA CGCATCGCTG GCAGTTTGCT CGGAGCCATT
GCGGCACTGC TTCTTGCCGG GCATACGCTC AATGAGCCGT GGTTTTTTCT ATTGAGCATG
TCGGCGTGGC TTGGCTTTTG TACCTGGGCC TGTGCGCACT TCACGAATAA CGTCGCGTAT
GCATTTCAAC TGGCGGGCTA CACGGCTGCC ATCATCGCCT TTCCGATGGT TAATATTACT
GAGGCCAGCC AGCTGTGGGA TATCGCTCAG GCGCGCGTTT GCGAGGTGAT TGTCGGCATT
TTGTGCGGCG GCATGATGAT GATGATCCTG CCTAGCAGTT CCGATGCTAC AGCCCTTTTA
ACCGCATTGA AAAACATGCA CGCCCGACTA CTTGAACATG CCAGTTTACT CTGGCAGCCT
GAAACAACCG ATGCCATTCG TGCAGCACAT GAAGGGGTGA TTGGGCAGAT ACTGACCATG
AATTTGCTGC GTATCCAGGC TTTCTGGAGC CACTATCGTT TTCGCCAGCA AAACGCGCGC
CTTAATGCGC TGCTCCACCA GCAATTACGT ATGACCAGTG TCATCTCCAG CCTGCGACGT
ATGTTGCTCA ACTGGCCCTC ACCGCCAGGT GCCACACGAG AAATTCTCGA ACAGTTGCTG
ACGGCGCTCG CCAGTTCGCA AACAGATGTT TACACCGTCG CACGTATTAT CGCCTCGCTA
CGCCCGACCA ACGTCGCCGA CTATCGGCAC GTCGCCTTCT GGCAGCGACT ACGTTATTTT
TGCCGCCTTT ATCTGCAAAG TAGTCAGGAA TTACATCGTC TGCAAAGCGG TGTAGATGAT
CATACCAGAC TCCCACGGAC ATCCGGCCTG GCTCGTCATA CCGATAACGC CGAAGCTATG
TGGAGCGGGC TGCGTACATT TTGTACGTTG ATGATGATTG GCGCATGGAG TATTGCTTCG
CAATGGGATG CCGGTGCCAA TGCATTAACG CTGGCAGCAA TTAGCTGCGT ACTCTACTCC
GCCGTCGCAG CACCGTTTAA GTCGTTGTCA CTTCTGATGC GCACGCTGGT GTTACTTTCG
CTATTCAGCT TTGTGGTCAA ATTTGGTCTG ATGGTCCAGA TTAGCGATCT GTGGCAATTT
TTACTGTTTC TCTTTCCACT GCTGGCGACA ATGCAGCTTC TTAAATTGCA GATGCCAAAA
TTTGCCGCAT TGTGGGGGCA ACTGATTGTT TTTATGGGTT CTTTTATCGC TGTCACTAAT
CCCCCGGTGT ATGATTTTGC TGATTTTCTT AACGATAATC TGGCAAAAAT CGTTGGCGTC
GCGTTGGCGT GGTTAGCGTT CGCCATTCTG CGTCCAGGAT CGGATGCTCG TAAAAGCCGC
CGCCATATTC GCGCGCTGCG CCGGGATTTT GTCGATCAGC TAAGCCGCCA TCCAACACTG
AGTGAAAGCG AATTTGAATC GCTCACTTAT CATCACGTCA GTCAGTTGAG TAACAGCCAG
GATGCGCTGG CTCGCCGTTG GTTATTACGC TGGGGTGTAG TGCTGCTGAA CTGTTCTCAT
GTTGTCTGGC AATTGCGCGA CTGGGAATCG CGTTCCGATC CGTTATCGCG AGTACGGGAT
AACTGTATTT CACTGTTGCG GGGAGTGATG AGTGAGCGTG GCGTTCAGCA AAAATCACTG
GCGGCCACAC TTGAAGAATT ACAGCGGATT TGCGACAGCC TTGCCCGTCA TCATCAACCT
GCCGCCCGTG AGCTGGCGGC AATTGTCTGG CGGCTGTACT GCTCGCTTTC GCAACTTGAG
CAAGCACCAC CGCAAGGTAC GCTGGCCTCT TAA
 
Protein sequence
MNASSWSLRN LPWFRATLAQ WRYALRNTIA MCLALTVAYY LNLDEPYWAM TSAAVVSFPT 
VGGVISKSLG RIAGSLLGAI AALLLAGHTL NEPWFFLLSM SAWLGFCTWA CAHFTNNVAY
AFQLAGYTAA IIAFPMVNIT EASQLWDIAQ ARVCEVIVGI LCGGMMMMIL PSSSDATALL
TALKNMHARL LEHASLLWQP ETTDAIRAAH EGVIGQILTM NLLRIQAFWS HYRFRQQNAR
LNALLHQQLR MTSVISSLRR MLLNWPSPPG ATREILEQLL TALASSQTDV YTVARIIASL
RPTNVADYRH VAFWQRLRYF CRLYLQSSQE LHRLQSGVDD HTRLPRTSGL ARHTDNAEAM
WSGLRTFCTL MMIGAWSIAS QWDAGANALT LAAISCVLYS AVAAPFKSLS LLMRTLVLLS
LFSFVVKFGL MVQISDLWQF LLFLFPLLAT MQLLKLQMPK FAALWGQLIV FMGSFIAVTN
PPVYDFADFL NDNLAKIVGV ALAWLAFAIL RPGSDARKSR RHIRALRRDF VDQLSRHPTL
SESEFESLTY HHVSQLSNSQ DALARRWLLR WGVVLLNCSH VVWQLRDWES RSDPLSRVRD
NCISLLRGVM SERGVQQKSL AATLEELQRI CDSLARHHQP AARELAAIVW RLYCSLSQLE
QAPPQGTLAS