Gene EcolC_3070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3070 
Symbol 
ID6066169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3353888 
End bp3355300 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content53% 
IMG OID641602486 
Productphenylalanine transporter 
Protein accessionYP_001726021 
Protein GI170021067 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCTCA ACAAAAAAGA CACACAGGGG AAAGGCGTGA AAAACGCGTC AACCGTATCG 
GAAGATACTG CGTCGAATCA AGAGCCGACG CTTCATCGCG GATTACATAA CCGTCATATT
CAACTGATTG CGTTGGGTGG CGCAATTGGT ACTGGTCTGT TTCTTGGCAT TGGCCCGGCG
ATTCAGATGG CGGGTCCGGC TGTATTGCTG GGCTACGGCG TCGCCGGGAT CATCGCTTTC
CTGATTATGC GCCAGCTTGG CGAAATGGTG GTTGAGGAGC CGGTATCCGG TTCATTTGCC
CACTTTGCCT ATAAATACTG GGGACCGTTT GCGGGCTTCC TCTCTGGCTG GAACTACTGG
GTAATGTTCG TGCTGGTGGG AATGGCAGAG CTGACCGCTG CGGGCATCTA TATGCAGTAC
TGGTTCCCGG ATGTTCCAAC GTGGATTTGG GCTGCCGCCT TCTTTATTAT CATCAACGCC
GTTAACCTGG TGAACGTGCG CTTATATGGC GAAACCGAGT TCTGGTTTGC GCTGATTAAA
GTGCTGGCGA TCATCGGTAT GATCGGCTTT GGCCTGTGGC TGCTGTTTTC TGGTCACGGC
GGCGAGAAAG CCAGTATCGA CAACCTCTGG CGCTACGGTG GTTTCTTCGC CACCGGCTGG
AATGGGCTGA TTTTGTCGCT GGCGGTAATT ATGTTCTCCT TCGGCGGTCT GGAGCTGATT
GGGATTACTG CCGCTGAAGC GCGCGATCCG GAAAAAAGCA TTCCAAAAGC GGTAAATCAG
GTGGTGTATC GCATCCTGCT GTTTTACATC GGTTCACTGG TGGTTTTACT GGCGCTCTAT
CCGTGGGTGG AAGTGAAATC CAACAGTAGC CCGTTTGTGA TGATTTTCCA TAATCTCGAC
AGCAACGTGG TAGCTTCTGC GCTGAACTTC GTCATTCTGG TAGCATCGCT GTCAGTGTAT
AACAGCGGGG TTTACTCTAA CAGCCGCATG CTGTTTGGCC TTTCTGTGCA GGGTAATGCG
CCGAAGTTTT TGACTCGCGT CAGCCGTCGC GGTGTGCCGA TTAACTCGCT GATGCTTTCC
GGAGCGATCA CTTCGCTGGT GGTGTTAATC AACTATCTGC TGCCGCAAAA AGCGTTTGGT
CTGCTGATGG CGCTGGTGGT AGCAACGCTG CTGTTGAACT GGATTATGAT CTGTCTGGCG
CATCTGCGTT TTCGTGCAGC GATGCGACGT CAGGGGCGTG AAACACAGTT TAAGGCGCTG
CTTTATCCGT TCGGCAACTA TCTTTGCATC GCCTTCCTCG GCATGATTTT GCTGCTGATG
TGCACGATGG ATGATATGCG CTTGTCAGCG ATCCTGCTGC CGGTGTGGAT TGTATTCCTG
TTTGTGGCAT TTAAAACGCT GCGTCGGAAA TAA
 
Protein sequence
MPLNKKDTQG KGVKNASTVS EDTASNQEPT LHRGLHNRHI QLIALGGAIG TGLFLGIGPA 
IQMAGPAVLL GYGVAGIIAF LIMRQLGEMV VEEPVSGSFA HFAYKYWGPF AGFLSGWNYW
VMFVLVGMAE LTAAGIYMQY WFPDVPTWIW AAAFFIIINA VNLVNVRLYG ETEFWFALIK
VLAIIGMIGF GLWLLFSGHG GEKASIDNLW RYGGFFATGW NGLILSLAVI MFSFGGLELI
GITAAEARDP EKSIPKAVNQ VVYRILLFYI GSLVVLLALY PWVEVKSNSS PFVMIFHNLD
SNVVASALNF VILVASLSVY NSGVYSNSRM LFGLSVQGNA PKFLTRVSRR GVPINSLMLS
GAITSLVVLI NYLLPQKAFG LLMALVVATL LLNWIMICLA HLRFRAAMRR QGRETQFKAL
LYPFGNYLCI AFLGMILLLM CTMDDMRLSA ILLPVWIVFL FVAFKTLRRK