Gene ECD_00049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00049 
SymbolyaaU 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp50003 
End bp51334 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content54% 
IMG OID 
Productpredicted transporter 
Protein accessionACT41950 
Protein GI253976280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCGT CCAGAAACTT TGACGATCTC AAATTCTCCT CTATTCACCG CCGCATTTTG 
CTGTGGGGAA GCGGTGGTCC GTTTCTGGAT GGTTATGTAC TGGTAATGAT TGGCGTGGCG
CTGGAGCAAC TGACGCCGGC GCTAAAACTG GACGCTGACT GGATTGGCTT GTTGGGCGCG
GGAACGCTCG CCGGGCTGTT CGTTGGCACA TCGCTGTTTG GCTATATTTC CGATAAAGTC
GGACGGCGCA AAATGTTCCT CATTGATATC ATCGCCATCG GCGTGATATC GGTGGCGACG
ATGTTTGTTT CATCCCCCGT CGAACTGTTG GTGATGCGGG TACTTATCGG CATTGTCATC
GGTGCAGATT ATCCCATCGC CACCTCAATG ATCACCGAGT TCTCCAGTAC CCGTCAGCGG
GCGTTTTCCA TCAGCTTTAT CGCCGCCATG TGGTATGTCG GCGCGACCTG CGCCGATCTG
GTCGGCTACT GGCTTTATGA TGTGGAAGGC GGCTGGCGCT GGATGCTGGG TAGCGCGGCG
ATCCCCTGTT TGTTGATTTT GATTGGTCGA TTCGAACTGC CTGAATCTCC CCGCTGGTTA
TTACGCAAAG GGCGAGTAAA AGAGTGCGAA GAGATGATGA TAAAACTGTT TGGCGAACCG
GTGGCTTTCG ATGAAGAGCA GCCGCAGCAA ACCCGTTTTC GCGATCTGTT TAATCGCCGC
CATTTTCCTT TTGTTCTGTT TGTTGCCGCC ATCTGGACCT GCCAGGTGAT CCCAATGTTC
GCCATTTACA CCTTTGGCCC GCAAATCGTT GGTTTGTTGG GATTGGGTGT TGGCAAAAAC
GCGGCGCTGG GGAACGTGGT GATTAGCCTG TTCTTTATGC TCGGCTGTAT TCCGCCGATG
CTGTGGTTAA ACACTGCCGG ACGGCGTCCA TTGTTGATTG GCAGCTTTGC CATGATGACG
CTGGCGCTGG CGGTTTTGGG GCTGATCCCG GATATGGGGA TCTGGCTGGT AGTGATGGCC
TTTGCGGTGT ATGCCTTTTT CTCTGGCGGG CCGGGTAATT TGCAGTGGCT CTATCCTAAT
GAACTCTTCC CGACAGATAT CCGCGCCTCT GCCGTGGGCG TGATTATGTC CTTAAGCCGT
ATTGGCACCA TTGTTTCGAC CTGGGCACTG CCGATCTTTA TCAATAATTA CGGTATCAGT
AACACGATGC TAATGGGGGC GGGTATCTCG CTGTTTGGCT TGTTGATTTC CGTAGCGTTT
GCCCCGGAGA CTCGAGGGAT GTCACTGGCG CAAACCAGCA ATATGACGAT CCGCGGGCAG
AGAATGGGGT AA
 
Protein sequence
MQPSRNFDDL KFSSIHRRIL LWGSGGPFLD GYVLVMIGVA LEQLTPALKL DADWIGLLGA 
GTLAGLFVGT SLFGYISDKV GRRKMFLIDI IAIGVISVAT MFVSSPVELL VMRVLIGIVI
GADYPIATSM ITEFSSTRQR AFSISFIAAM WYVGATCADL VGYWLYDVEG GWRWMLGSAA
IPCLLILIGR FELPESPRWL LRKGRVKECE EMMIKLFGEP VAFDEEQPQQ TRFRDLFNRR
HFPFVLFVAA IWTCQVIPMF AIYTFGPQIV GLLGLGVGKN AALGNVVISL FFMLGCIPPM
LWLNTAGRRP LLIGSFAMMT LALAVLGLIP DMGIWLVVMA FAVYAFFSGG PGNLQWLYPN
ELFPTDIRAS AVGVIMSLSR IGTIVSTWAL PIFINNYGIS NTMLMGAGIS LFGLLISVAF
APETRGMSLA QTSNMTIRGQ RMG