Gene ECD_00007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00007 
SymbolyaaJ 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp6527 
End bp7957 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID 
Productpredicted transporter 
Protein accessionACT41909 
Protein GI253976239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGATT TTTTCTCCTT TATTAACAGC GTCCTTTGGG GATCGGTAAT GATTTACCTG 
CTCTTCGGCG CAGGTTGTTG GTTCACTTTT CGCACCGGAT TTGTGCAGTT TCGCTACATC
CGCCAGTTTG GCAAAAGTCT TAAAAATAGC ATTCATCCAC AGCCAGGCGG TTTAACCTCA
TTTCAGTCAT TGTGTACCAG TCTTGCGGCG CGCGTGGGTA GCGGCAACCT GGCCGGCGTT
GCGCTGGCAA TTACCGCCGG TGGACCTGGA GCCGTCTTCT GGATGTGGGT TGCCGCGTTT
ATCGGCATGG CGACCTCGTT TGCCGAATGT TCCCTCGCAC AGCTTTATAA AGAACGTGAT
GCCAATGGGC AGTTTCGTGG CGGACCGGCA TGGTATATGG CGCGCGGGCT GGGGATGCGC
TGGATGGGCG TTCTGTTCGC CGTCTTTTTG CTCATCGCCT ACGGCATAAT TTTCAGCGGA
ATTCAGGCGA ACGCCGTTGC GCGAGCCCTG AGTTTTTCTT TTGATTTTCC CCCGCTGGTG
ACAGGTATTA TTCTCGCTGT CTTTGCTCTG CTGGCGATCA CTCGCGGTCT TCATGGCGTC
GCCCGGCTCA TGCAGGGGTT TGTCCCGTTG ATGGCGATAA TCTGGGTACT GACCAGCCTG
GTGATTTGCG TAATGAATAT CGGGCAACTT CCCCACGTCA TTTGGTCTAT TTTTGAGAGT
GCTTTTGGCT GGCAGGAAGC GGCAGGCGGC GCGGCGGGAT ATACCTTAAG CCAGGCGATT
ACTAACGGTT TTCAGCGCAG TATGTTTTCC AATGAGGCGG GAATGGGGTC GACGCCAAAC
GCGGCAGCGG CAGCGGCGTC CTGGCCTCCG CATCCGGCAG CGCAAGGGAT TGTCCAGATG
ATTGGCATTT TTATCGACAC CCTGGTCATC TGTACGGCAA GCGCCATGCT GATATTACTG
GCGGGTAACG GCACAACCTA CATGCCGCTG GAAGGTATTC AGCTTATCCA GAAGGCGATG
CGGGTGTTAA TGGGTTCCTG GGGTGCTGAG TTTGTTACCC TCGTGGTTAT TCTGTTTGCC
TTCAGCTCCA TCGTTGCCAA CTACATTTAC GCCGAAAACA ATCTCTTCTT TTTACGCCTG
AACAACCCTA AAGCGATCTG GTGTTTGCGG ATCTGCACCT TCGCAACGGT CATCGGCGGC
ACCTTGCTAA GTCTTCCGCT GATGTGGCAA CTGGCAGATA TCATAATGGC CTGCATGGCT
ATTACCAATT TGACCGCCAT TTTACTGCTC TCGCCTGTGG TTCATACCAT TGCCAGTGAT
TATCTACGCC AGCGTAAACT CGGCGTGCGC CCGGTGTTTG ATCCGTTGCG TTATCCGGAT
ATCGGCCGCC AGCTTTCTCC GGACGCGTGG GATGATGTTT CGCAGGAGTA A
 
Protein sequence
MPDFFSFINS VLWGSVMIYL LFGAGCWFTF RTGFVQFRYI RQFGKSLKNS IHPQPGGLTS 
FQSLCTSLAA RVGSGNLAGV ALAITAGGPG AVFWMWVAAF IGMATSFAEC SLAQLYKERD
ANGQFRGGPA WYMARGLGMR WMGVLFAVFL LIAYGIIFSG IQANAVARAL SFSFDFPPLV
TGIILAVFAL LAITRGLHGV ARLMQGFVPL MAIIWVLTSL VICVMNIGQL PHVIWSIFES
AFGWQEAAGG AAGYTLSQAI TNGFQRSMFS NEAGMGSTPN AAAAAASWPP HPAAQGIVQM
IGIFIDTLVI CTASAMLILL AGNGTTYMPL EGIQLIQKAM RVLMGSWGAE FVTLVVILFA
FSSIVANYIY AENNLFFLRL NNPKAIWCLR ICTFATVIGG TLLSLPLMWQ LADIIMACMA
ITNLTAILLL SPVVHTIASD YLRQRKLGVR PVFDPLRYPD IGRQLSPDAW DDVSQE