Gene Elen_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2031 
Symbol 
ID8416342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2379308 
End bp2380303 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content61% 
IMG OID645025008 
ProductABC-type transporter, periplasmic component 
Protein accessionYP_003182384 
Protein GI257791778 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.702659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000664756 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCCTTG TGATGACCCG GCGATCGTTT TTGGCGGTTG CGGCGGGTGC GGCTGCGTCG 
CTTGCCTTGT CGGGATGCAG CTCCGCGAGC GATAACACCG TGGTGATCTA CTCGTGCGGC
GAGGGCGAGG CGAACGAGGT GCTGCTCGAG GCCATGCATC GCGATCTGCC GCAGTACGAT
ATCCGCTTGC ACTACGTGTC GACCGGCACG TGCGCGGCGA AGCTCCAGAA CGAGGGTACG
TCGAGCGAGG CCGATATCGT TCTCATGCTC GAGGGCGGTT ACCTCAGGCA GATTCAGCCG
AGCTTGGCGA AGCTGACCTC GTACGATTTC GAAGTGTTCG AAGACGATCT GCTCGACGGC
TCGAGTACCT ACCTTCCCTT CAGGCGCGAG AGCGCATGCG TTGCCATGAA CGTCGGGGAG
CTGACCGCTC GCGGCATCGC CATACCCGAG ACGTACGACG ATCTGCTCGA TCCAACGTAC
CGTGGGCTCA TCACGATGGC GAATCCGAAA TCTTCCGGTA CGGGCTACAA CTTCGTTAAG
AGTCTCGTGA ACACGCGGGG CGAGGATGCG GCCTTCGAGT ACTTTGACAA GCTGGCCGAG
AACGTGTACC AGTTCTCGTC GTCGGGATCG GGGCCGGTCA ACGCGCTCGT GCAAGGAGAG
GCGTTGATCG GGTTCGGCCT CACCTACCAA GCGGTGTCCG AGATCAACAA GGGCGTGCCC
ATCGAGGTGC GGTTCTTCGA GGAGGGCTCG CCTTGGACGA TGAACGGCGT GGCGGTGGTC
GACGGCAAGC AGGATAAGCC GGCGGTGCGA GCCGTTATGG ATTGGATGTT CAGCACGGGC
ATCCTGCTGG ACAAGCAGGA GTTCGTCCCG GACAAGGTGT TCGTCGATCA GCATACCGAG
ATCCCGAACT ACCCGCAGGA TACGCACTAC GCGGACATGG AAGGCGTGTT CGACATCGAC
GAGAAAAAGC GGTTGCTAGG GAAGTGGAAG TACTGA
 
Protein sequence
MPLVMTRRSF LAVAAGAAAS LALSGCSSAS DNTVVIYSCG EGEANEVLLE AMHRDLPQYD 
IRLHYVSTGT CAAKLQNEGT SSEADIVLML EGGYLRQIQP SLAKLTSYDF EVFEDDLLDG
SSTYLPFRRE SACVAMNVGE LTARGIAIPE TYDDLLDPTY RGLITMANPK SSGTGYNFVK
SLVNTRGEDA AFEYFDKLAE NVYQFSSSGS GPVNALVQGE ALIGFGLTYQ AVSEINKGVP
IEVRFFEEGS PWTMNGVAVV DGKQDKPAVR AVMDWMFSTG ILLDKQEFVP DKVFVDQHTE
IPNYPQDTHY ADMEGVFDID EKKRLLGKWK Y