Gene ECD_02720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02720 
SymbolygfU 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2862228 
End bp2863676 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content49% 
IMG OID 
Productpredicted transporter 
Protein accessionACT44537 
Protein GI253978867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.553016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCA TAGATTCCCA ACTTCCCTCA TCTTCTGGGC AAGACCGCCC AACTGATGAG 
GTTGACCGCA TATTATCACC AGGAAAGCTG ATCATACTCG GTCTGCAACA CGTCCTTGTC
ATGTACGCAG GTGCAGTCGC TGTTCCTCTT ATGATTGGTG ACCGACTGGG CCTCTCAAAA
GAAGCTATTG CGATGCTCAT TAGCTCGGAT CTCTTTTGCT GCGGGATCGT CACATTATTG
CAATGTATCG GTATCGGCCG CTTTATGGGG ATCCGCCTGC CGGTGATTAT GTCGGTGACC
TTTGCTGCTG TAACACCAAT GATAGCCATT GGGATGAACC CGGATATCGG CCTGCTGGGG
ATCTTTGGTG CCACTATCGC CGCGGGTTTT ATCACCACAT TATTAGCGCC ACTTATCGGT
CGCTTGATGC CTTTATTCCC GCCACTGGTT ACCGGTGTGG TTATTACTTC TATTGGGCTT
AGCATCATTC AGGTGGGTAT TGACTGGGCC GCCGGAGGTA AAGGGAATCC GCAATATGGT
AATCCCGTTT ATTTAGGTAT CTCCTTTGCC GTCTTAATTT TTATCTTGCT CATTACTCGC
TATGCGAAAG GATTTATGTC CAACGTCGCC GTATTACTGG GGATTGTATT TGGCTTTTTA
CTTTCGTGGA TGATGAATGA AGTCAACTTA TCCGGGCTAC ATGATGCCTC GTGGTTTGCG
ATTGTCACCC CGATGTCGTT TGGTATGCCG ATTTTCGATC CCGTTTCCAT TCTGACCATG
ACTGCCGTGT TAATCATCGT GTTTATCGAG TCGATGGGGA TGTTCCTGGC ACTGGGTGAA
ATAGTCGGTC GTAAACTCTC TTCACACGAT ATTATTCGCG GGCTGCGTGT CGATGGCGTA
GGGACAATGA TAGGCGGAAC GTTTAACAGC TTCCCCCACA CGTCATTTTC ACAAAACGTT
GGCCTGGTTA GCGTGACGCG TGTTCATAGC CGCTGGGTGT GTATTTCTTC GGGAATTATA
TTAATCCTGT TTGGCATGGT GCCAAAAATG GCGGTGCTGG TCGCCTCCAT TCCGCAATTT
GTGCTGGGCG GCGCTGGGCT GGTGATGTTC GGCATGGTAC TGGCGACAGG GATTCGAATT
CTGTCGCGCT GTAACTACAC CACCAACCGT TACAACCTCT ATATTGTGGC GATCAGTCTC
GGCGTTGGCA TGACTCCGAC GCTCTCTCAC GATTTCTTTT CTAAGTTACC GGCCGTACTG
CAACCGCTGC TACATAGCGG CATTATGCTC GCAACCCTTA GCGCCGTTGT GCTGAACGTC
TTCTTTAATG GCTATCAGCA TCATGCTGAC CTGGTGAAGG AATCCGTCTC TGATAAAGAT
TTAAAAGTCA GGACAGTACG TATGTGGCTT CTGATGCGCA AGCTGAAGAA AAATGAGCAT
GGAGAATAA
 
Protein sequence
MSAIDSQLPS SSGQDRPTDE VDRILSPGKL IILGLQHVLV MYAGAVAVPL MIGDRLGLSK 
EAIAMLISSD LFCCGIVTLL QCIGIGRFMG IRLPVIMSVT FAAVTPMIAI GMNPDIGLLG
IFGATIAAGF ITTLLAPLIG RLMPLFPPLV TGVVITSIGL SIIQVGIDWA AGGKGNPQYG
NPVYLGISFA VLIFILLITR YAKGFMSNVA VLLGIVFGFL LSWMMNEVNL SGLHDASWFA
IVTPMSFGMP IFDPVSILTM TAVLIIVFIE SMGMFLALGE IVGRKLSSHD IIRGLRVDGV
GTMIGGTFNS FPHTSFSQNV GLVSVTRVHS RWVCISSGII LILFGMVPKM AVLVASIPQF
VLGGAGLVMF GMVLATGIRI LSRCNYTTNR YNLYIVAISL GVGMTPTLSH DFFSKLPAVL
QPLLHSGIML ATLSAVVLNV FFNGYQHHAD LVKESVSDKD LKVRTVRMWL LMRKLKKNEH
GE