Gene EcolC_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3854 
Symbol 
ID6064454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4211735 
End bp4213237 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content47% 
IMG OID641603269 
Productamino acid permease-associated region 
Protein accessionYP_001726785 
Protein GI170021831 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00146223 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTCACA CGATAAAAAA AATGAGTCTG ATAGGACTCA TATTGATGAT CTTTACTTCC 
GTATTTGGTT TTGCCAATAG CCCATCGGCT TATTACTTAA TGGGTTATAG TGCGATTCCC
TTTTATATAT TTTCTGCATT GTTATTCTTT ATTCCATTCG CCTTAATGAT GGCTGAAATG
GGAGCTGCTT ATCGCAAAGA AGAAGGTGGT ATCTATTCCT GGATGAATAA TAGTGTCGGA
CCACGTTTTG CCTTCATTGG TACGTTTATG TGGTTTTCCT CTTATATCAT CTGGATGGTG
AGTACCTCCG CGAAAGTTTG GGTACCGTTC TCAACATTCC TCTATGGTAG CGACATGACC
CAGCACTGGC GTATTGCCGG ACTGGAGCCT ACGCAGGTGG TTGGTCTGCT GGCAGTGGCA
TGGATGATTC TGGTCACCGT CGTTGCTTCT AAGGGGATTA ATAAAATTGC CCGCATTACT
GCGGTGGGCG GTATTGCAGT AATGTGTCTG AATTTAGTAT TGCTGTTAGT AAGCATTACT
ATTTTGTTAT TAAATGGTGG GCATTTCGCG CAGGATATTA ATTTCCTTGC ATCACCGAAC
CCGGGTTATC AGTCCGGTCT GGCAATGCTA TCGTTTGTGG TATTTGCCAT TTTTGCCTAT
GGCGGAATTG AAGCGGTTGG TGGTCTGGTC GATAAAACGG AAAATCCAGA AAAGAACTTT
GCCAAAGGTA TTGTTTTTGC CGCTATTGTT ATTTCAATCG GTTATTCGCT GGCAATATTT
TTATGGGGCG TCAGCACAAA CTGGCAGCAG GTATTAAGTA ATGGTTCCGT TAACCTCGGC
AATATTACCT ATGTGCTGAT GAAGAGCCTT GGGATGACGC TGGGTAATGC ACTGCATTTG
TCACCTGAAG CGTCATTGTC GCTGGGCGTA TGGTTTGCGC GTATTACTGG ACTTTCGATG
TTCCTCGCCT ATACCGGTGC GTTCTTTACG CTTTGCTATT CACCGTTGAA AGCCATCATC
CAGGGGACGC CGAAAGCATT GTGGCCGGAA CCGATGACGC GCCTGAATGC GATGGGGATG
CCGTCTATCG CCATGTGGAT GCAGTGCGGG TTGGTTACTA TCTTCATTCT GCTGGTTTCG
TTTGGTGGCG GTACCGCATC GGCGTTCTTT AACAAGCTGA CGCTGATGGC GAACGTGTCT
ATGACGCTTC CTTACCTGTT CCTCGCGCTG GCTTTCCCAT TCTTTAAAGC ACGTCAGGAT
CTCGACAGAC CATTTGTGAT TTTCAAAACG CGTATGTCGG CAATGATCGC GACGGTGGTT
GTCGTACTGG TGGTGACATT TGCGAACGTC TTCACCATCA TTCAACCTGT GGTTGAAGCT
GGAGACTGGG ACAGCACATT GTGGATGATT GGCGGCCCTG TCTTCTTCTC GCTGTTAGCG
ATGGCGATTT ACCAGAACTA TTGCAGCAGA GTGGCAAAAA ATCCGCAGTG GGCGGTGGAA
TAA
 
Protein sequence
MPHTIKKMSL IGLILMIFTS VFGFANSPSA YYLMGYSAIP FYIFSALLFF IPFALMMAEM 
GAAYRKEEGG IYSWMNNSVG PRFAFIGTFM WFSSYIIWMV STSAKVWVPF STFLYGSDMT
QHWRIAGLEP TQVVGLLAVA WMILVTVVAS KGINKIARIT AVGGIAVMCL NLVLLLVSIT
ILLLNGGHFA QDINFLASPN PGYQSGLAML SFVVFAIFAY GGIEAVGGLV DKTENPEKNF
AKGIVFAAIV ISIGYSLAIF LWGVSTNWQQ VLSNGSVNLG NITYVLMKSL GMTLGNALHL
SPEASLSLGV WFARITGLSM FLAYTGAFFT LCYSPLKAII QGTPKALWPE PMTRLNAMGM
PSIAMWMQCG LVTIFILLVS FGGGTASAFF NKLTLMANVS MTLPYLFLAL AFPFFKARQD
LDRPFVIFKT RMSAMIATVV VVLVVTFANV FTIIQPVVEA GDWDSTLWMI GGPVFFSLLA
MAIYQNYCSR VAKNPQWAVE