Gene Elen_3060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3060 
Symbol 
ID8417395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3557579 
End bp3558961 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content62% 
IMG OID645026040 
Productpermease for cytosine/purines uracil thiamine allantoin 
Protein accessionYP_003183392 
Protein GI257792786 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCTG AAAACGACAT CGATTACTCG CTGTCGCGCG TTCCTGATGA AGCGAAACAG 
CCATTTTGGC GGATTCTTTT CATCAGGATC GGCGCGATCT GCTGCGTATC CCAGCTCATG
TTGGGCGCGG CACTAGGCTA CGGCCTGACG TTCTGGGATG CCTTCCTGGC AACCATGCTC
GGCTCGGTGC TGCTTCAGGT GGTCAGCTGG GCGCTGGGCA CGGCGGCCGC GCGCGAAGGC
TTGTCCACCA GCCTGCTGTC CCGCTGGACC GGGTTCGGCA AGGTAGGATC CGCCCTGTTC
GGCGGCGTGG TGGCTATCTC CATGGTGGGC TGGTTCGGCG TGCAGAACGC GGTGTTTGGC
CAGGGCATGG CTGAAATCGT CCCGTTCACA GATTTCCTCG GCACGCAGGA GATCCTGCCC
GGCATCATGG CTGGAATCAC GCCCGAGTAC ATCTTCTGGG CCATCATCAC CGGCCTGGGC
ATCACGCTGC TCGTGGTGTT CGGCATCAAG GCCATCGCGA ACTTCGCCAC GGTGTTCGTG
CCGCTGTTCG TGATCGTGGT CATCGTAGCC GCAGCCATCA TCCTGCAGAA CCATTCGCTG
ACCGAGCTTC TCACCACGGC CCCTCCGGGA CCGGCGCTGT CGCTGGGCGC GGCAACCACC
ATGGTGGCGG GCGGCTTCAT TGCGGGCGCC ATCTGCACGC CCGACTACGC GCGATTCCTG
AAGAACGGCA CCCAAGCATT CTGGATGACG CTCATCGGCA CGTTCGTGGG CGAGCTGGGC
ATGAACCTGC TTGCCGTGCT GCTGGCGCAC GCCATGGGCA CCGAGAATAT CGTCGACATC
ATGATGGGCA CGTCGGGCAT CATCGGCGTC ATCATCGTAG TCGCCTCCAC GGTGAAGCTG
AACGACATCA ACCTGTACTC GTCCAGCCTG GGCTTGGCAA CCATGATCAA CGCGCTGTTC
AACAAGGCCA TCAGCCGCAA CGGACTCGTG TGGGCGCTCG GCATCGTGGG CACGCTGCTG
TCGGTCATCG GCATCATCAA CTACTTCACT AACTTCCTCA CGCTGCTGGG CGTGGCCATC
CCGCCCGTCG CCGGCATCAT GGTGGTGGAC TACTTCATCT TGAAGCGCAG CCGCGCGACG
CTTGACGCTT CGCGCGCCAA GGGCGAGCTG CCCGAGAAGG TTGAGAAGTG GAACCCCATC
GCCATCGTCT GCTGGATCGC CGGTTTCGCC GTGGGCGAGG TCACCAGCAT CATGAACGCG
GGCATTCCGG GCCTGAACTC GCTGATCCTG GCCGGCGTGC TGTACTGGAT CGTGATGAAG
GTGTACGCCT CCATGAAGAA GGTGGACACC GTTACGTTCA CGGAAACGGA CCAAGTGCTG
TAA
 
Protein sequence
MAPENDIDYS LSRVPDEAKQ PFWRILFIRI GAICCVSQLM LGAALGYGLT FWDAFLATML 
GSVLLQVVSW ALGTAAAREG LSTSLLSRWT GFGKVGSALF GGVVAISMVG WFGVQNAVFG
QGMAEIVPFT DFLGTQEILP GIMAGITPEY IFWAIITGLG ITLLVVFGIK AIANFATVFV
PLFVIVVIVA AAIILQNHSL TELLTTAPPG PALSLGAATT MVAGGFIAGA ICTPDYARFL
KNGTQAFWMT LIGTFVGELG MNLLAVLLAH AMGTENIVDI MMGTSGIIGV IIVVASTVKL
NDINLYSSSL GLATMINALF NKAISRNGLV WALGIVGTLL SVIGIINYFT NFLTLLGVAI
PPVAGIMVVD YFILKRSRAT LDASRAKGEL PEKVEKWNPI AIVCWIAGFA VGEVTSIMNA
GIPGLNSLIL AGVLYWIVMK VYASMKKVDT VTFTETDQVL