Gene ECD_02947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02947 
SymbolygjI 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3092316 
End bp3093749 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content50% 
IMG OID 
Productpredicted transporter 
Protein accessionACT44751 
Protein GI253979081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATA CCAAACGTAA TACAATCGGC AAATTCGGCT TGCTCTCGCT GACTTTTGCC 
GCCGTTTACA GCTTTAACAA CGTTATCAAT AATAATATTG AGCTTGGACT GGCCTCGGCA
CCGATGTTTT TCCTCGCGAC GATTTTTTAT TTTATTCCCT TCTGTCTGAT CATCGCAGAA
TTTGTTTCGT TAAATAAAAA CTCAGAAGCC GGTGTCTACG CGTGGGTAAA AAGTTCGCTG
GGCGGACGTT GGGCATTTAT TACTGCCTAT ACCTACTGGT TCGTAAACCT GTTCTTTTTC
ACCTCACTGT TGCCGCGCGT TATTGCTTAT GCTTCGTATG CCTTCCTCGG ATACGAATAT
ATTATGACGC CGGTTGCCAC CACCATTATC AGTATGGTGC TGTTCGCCTT CTCCACCTGG
GTTTCCACCA ACGGGGCGAA AATGTTGGGG CCAATTACCT CCGTCACTTC AACGCTGATG
CTGCTGTTAA CGCTCTCCTA CATTTTACTG GCAGGTACGG CGCTGGTTGG CGGCGTACAG
CCTGCTGACG CCATCACCGT TGACGCGATG ATCCCGAACT TCAACTGGGC GTTCCTCGGC
GTTACCACCT GGATCTTTAT GGCCGCAGGT GGCGCGGAGT CCGTCGCTGT GTACGTTAAC
GACGTCAAAG GCGGTTCGAA ATCGTTCGTT AAAGTGATCA TCCTCGCCGG GATTTTTATC
GGCGTACTGT ATTCCGTCTC CTCGGTGCTG ATTAACGTCT TCGTCAGCAG CAAAGAGTTG
AAATTTACCG GCGGATCGGT GCAGGTATTC CACGGCATGG CGGCGTATTT TGGTCTACCG
GAAGCGTTGA TGAATCGCTT TGTCGGTCTG GTGTCCTTTA CCGCGATGTT CGGTTCCCTG
CTGATGTGGA CCGCAACGCC GGTGAAAATT TTCTTCTCCG AAATCCCGGA AGGCATCTTT
GGTAAGAAAA CCGTCGAACT GAACGAAAAC GGCGTTCCGG CGCGCGCAGC GTGGATCCAG
TTCCTGATCG TCATCCCGCT GATGATTATC CCGATGCTCG GTTCCAATAC CGTGCAGGAT
CTGATGAATA CTATTATTAA TATGACCGCC GCAGCGTCCA TGCTTCCGCC GTTATTCATC
ATGCTGGCTT ACCTGAATTT ACGCGCCAAA TTAGATCACC TGCCACGCGA TTTCCGTATG
GGCTCCCGCC GCACCGGTAT TATCGTTGTT TCAATGCTGA TTGCGATATT TGCCGTAGGG
TTTGTCGCTT CGACATTCCC GACTGGCGCG AATATTCTGA CCATCATTTT TTATAACGTC
GGCGGTATTG TTATCTTCCT TGGCTTTGCG TGGTGGAAAT ACAGTAAATA TATAAAGGGA
TTAACGGCTG AAGAGCGCCA TATTGAAGCG ACGCCAGCCA GCAATGTTGA TTAA
 
Protein sequence
MSDTKRNTIG KFGLLSLTFA AVYSFNNVIN NNIELGLASA PMFFLATIFY FIPFCLIIAE 
FVSLNKNSEA GVYAWVKSSL GGRWAFITAY TYWFVNLFFF TSLLPRVIAY ASYAFLGYEY
IMTPVATTII SMVLFAFSTW VSTNGAKMLG PITSVTSTLM LLLTLSYILL AGTALVGGVQ
PADAITVDAM IPNFNWAFLG VTTWIFMAAG GAESVAVYVN DVKGGSKSFV KVIILAGIFI
GVLYSVSSVL INVFVSSKEL KFTGGSVQVF HGMAAYFGLP EALMNRFVGL VSFTAMFGSL
LMWTATPVKI FFSEIPEGIF GKKTVELNEN GVPARAAWIQ FLIVIPLMII PMLGSNTVQD
LMNTIINMTA AASMLPPLFI MLAYLNLRAK LDHLPRDFRM GSRRTGIIVV SMLIAIFAVG
FVASTFPTGA NILTIIFYNV GGIVIFLGFA WWKYSKYIKG LTAEERHIEA TPASNVD