Gene Elen_0693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0693 
Symbol 
ID8414983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp873945 
End bp875084 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content73% 
IMG OID645023666 
ProductApbE family lipoprotein 
Protein accessionYP_003181063 
Protein GI257790457 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAAGC GTTTCGCGCG ACAGAACCGA ACCACCCGCT CGTCCCGTCG TTCTTGGACC 
GGACGAGCGT TTGCTGCGCT GCTCGCATGC GCGCTGGGCG CTGCGCTGGC CGTCGGCCCC
GTCGGATGCG CGGGCGGCGA GCCCGACGGG GCGGACGTCC CGACCACAAC GGCCACCAGC
TCCTTCACGG CTATGGACAC CGGCATGACC CTCACCGTGC ACGCCGCCAG CCAAGCCGTC
GCTGACGACG CGGCAAACGC CTGCCGACAG CGCGTCCTCG AGCTGGACGA GCTGCTGGCC
CCCGAGAACG CGGCCAGCGA GATAGCCCGG GCGAATGCCG CAGGCGGCGC GCCGACGGCC
GTCTCGCCGG CGACGGCCGC GCTCGTCGCC GCGTCGCTCG ACGCGGCGAG CCAGACGGGC
GGCGCCTTCG ACCCCACGGT GTACCCGCTG ACGAGCGCCT GGGGCTTCAC GACCGGCGAC
CATCGGGTAC CCTCCACCGA CGAGCTGGCC GCGCTGCTGC CGCGGGTGGG ATACGCCGCC
GTGCAGGTGG ACGAAACCGC CGGCACCGTC ACGCTGGAAG GCGGCGCGCA GATCGACGTG
GGCGGCGTGG CGAAGGGCTT CGCAGCCGAC GAGCTGCGCG CCCTGCTGCG CGAGCGCGAC
ATAACCTCGG CGCTGTTCGA CCTCGGCGGC AACGTGACGG CCCTCGGCTC CAAGCCCGAC
GGCTCGCCCT GGAAGGTGGG CGTCGCCGAC CCGGACGACC CCGGGAAGCT GGCAGGCCTG
CTCGAGGTGC GCGACGCCAC GGTGTCTACC TCGGGCGCCT ACCAGCGATA CTTCGAAGGC
AAAGACGGAA CGCGCTACCA CCACTTGCTC GACCCCTCCA CCGGCTATCC TGCCGCGTCC
GACCTGGCCT CCGCAAGCGT CGTCGGCGCG GACGGCGCGC AATGCGACGC GCTGTCCACC
GCCTGCTTCG TGCTGGGGCT CGACGGCGCC CTCGACCTGT GGCGCGCGGC CGCGGCGGAC
GGCGATGCGG CGTTCGACCT CGTGCTGATC GCCTCCGACG GTCGCACGTT CGTCACCGAC
AGCATCGCCG ATGCGTACAC TCCGGCCGCC GGAACCAACG CCCAGGTGGT GCGCTCATGA
 
Protein sequence
MVKRFARQNR TTRSSRRSWT GRAFAALLAC ALGAALAVGP VGCAGGEPDG ADVPTTTATS 
SFTAMDTGMT LTVHAASQAV ADDAANACRQ RVLELDELLA PENAASEIAR ANAAGGAPTA
VSPATAALVA ASLDAASQTG GAFDPTVYPL TSAWGFTTGD HRVPSTDELA ALLPRVGYAA
VQVDETAGTV TLEGGAQIDV GGVAKGFAAD ELRALLRERD ITSALFDLGG NVTALGSKPD
GSPWKVGVAD PDDPGKLAGL LEVRDATVST SGAYQRYFEG KDGTRYHHLL DPSTGYPAAS
DLASASVVGA DGAQCDALST ACFVLGLDGA LDLWRAAAAD GDAAFDLVLI ASDGRTFVTD
SIADAYTPAA GTNAQVVRS