Gene ECD_00461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00461 
SymbolallP 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp506778 
End bp508232 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content47% 
IMG OID 
Productpredicted allantoin transporter 
Protein accessionACT42360 
Protein GI253976690 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.592868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACATC AGAGAAAACT ATTCCAGCAA CGCGGCTATA GCGAAGATCT ATTGCCGAAA 
ACGCAAAGCC AGCGGACCTG GAAAACATTT AACTATTTTA CCTTATGGAT GGGTTCGGTT
CATAACGTTC CCAATTATGT GATGGTCGGC GGCTTTTTTA TTCTCGGCTT GTCTACCTTT
AGTATTATGC TGGCAATTAT CCTCAGCGCC TTTTTCATTG CCGCGGTAAT GGTATTAAAC
GGTGCTGCGG GCAGTAAATA CGGTGTGCCT TTTGCCATGA TCCTGCGTGC TTCTTACGGT
GTACGTGGTG CACTGTTTCC CGGATTATTA AGGGGCGGAA TTGCCGCCAT CATGTGGTTT
GGTTTGCAAT GTTACGCGGG GTCACTGGCC TGCTTGATTC TGATTGGCAA AATCTGGCCG
GGATTTTTAA CTCTCGGTGG TGATTTCACT CTGTTAGGCC TTTCTCTACC GGGCTTAATT
ACTTTCTTAA TCTTCTGGCT GGTCAACGTT GGTATAGGTT TTGGCGGTGG CAAAGTTTTA
AATAAATTCA CTGCCATTCT TAACCCGTGC ATCTATATCG TTTTCGGCGG TATGGCGATT
TGGGCGATTT CACTGGTCGG GATCGGTCCA ATCTTTGACT ACATTCCGAG CGGTATTCAG
AAAGCAGAAA ACGGTGGCTT CCTGTTCCTG GTGGTGATTA ACGCGGTAGT TGCGGTCTGG
GCGGCACCGG CGGTGAGCGC ATCCGACTTT ACGCAAAACG CCCACTCGTT TCGTGAGCAG
GCGCTGGGGC AAACGCTGGG TTTAGTTGTG GCCTATATTC TGTTTGCGGT CGCCGGGGTA
TGTATTATTG CCGGAGCCAG TATTCACTAC GGCGCTGATA CCTGGAACGT GCTGGATATT
GTTCAGCGTT GGGACAGCCT GTTCGCCTCG TTCTTTGCGG TACTGGTTAT TCTGATGACA
ACTATCTCCA CTAACGCGAC CGGTAATATT ATTCCAGCCG GTTATCAGAT TGCCGCCATT
GCACCGACAA AACTGACCTA TAAAAACGGC GTACTGATTG CCAGTATTAT CAGCTTGCTG
ATCTGCCCGT GGAAATTAAT GGAAAATCAG GACAGCATTT ATCTTTTCCT CGATATTATC
GGCGGAATGC TTGGTCCGGT AATTGGTGTC ATGATGGCGC ATTATTTTGT GGTGATGCGC
GGACAAATTA ATCTTGATGA ACTGTATACC GCACCTGGCG ATTATAAATA TTACGATAAC
GGTTTTAACC TCACTGCGTT TTCAGTAACT CTGGTGGCCG TTATTTTATC TCTTGGCGGT
AAGTTTATTC ACTTTATGGA ACCGTTATCG CGTGTTTCAT GGTTTGTCGG CGTCATCGTC
GCCTTTGCGG CCTACGCCTT ATTAAAGAAA CGTACAACAG CAGAAAAAAC AGGAGAGCAA
AAAACCATAG GTTAA
 
Protein sequence
MEHQRKLFQQ RGYSEDLLPK TQSQRTWKTF NYFTLWMGSV HNVPNYVMVG GFFILGLSTF 
SIMLAIILSA FFIAAVMVLN GAAGSKYGVP FAMILRASYG VRGALFPGLL RGGIAAIMWF
GLQCYAGSLA CLILIGKIWP GFLTLGGDFT LLGLSLPGLI TFLIFWLVNV GIGFGGGKVL
NKFTAILNPC IYIVFGGMAI WAISLVGIGP IFDYIPSGIQ KAENGGFLFL VVINAVVAVW
AAPAVSASDF TQNAHSFREQ ALGQTLGLVV AYILFAVAGV CIIAGASIHY GADTWNVLDI
VQRWDSLFAS FFAVLVILMT TISTNATGNI IPAGYQIAAI APTKLTYKNG VLIASIISLL
ICPWKLMENQ DSIYLFLDII GGMLGPVIGV MMAHYFVVMR GQINLDELYT APGDYKYYDN
GFNLTAFSVT LVAVILSLGG KFIHFMEPLS RVSWFVGVIV AFAAYALLKK RTTAEKTGEQ
KTIG