Gene Elen_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1147 
Symbol 
ID8415437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1378080 
End bp1379660 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content72% 
IMG OID645024109 
ProductPSP1 domain protein 
Protein accessionYP_003181506 
Protein GI257790900 
COG category[S] Function unknown 
COG ID[COG1774] Uncharacterized homolog of PSP1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.233245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCGCA TAGCGCCGAT TAACCTGTAT TACAACCCCA AGACGCTGTG GTTCGACGCC 
GGCGACCTGG ACGTGCGCGC CGGGGACGGC GTGATCGTGT CCACGGCCCG CGGCACCGAG
TTCGGCCGCG CGGCCCACGA CGTGTTCGAG GCCGACGAGG CGCAGATCAA GAAGCTGAAA
AGCCCGCTCA AGCCTGTCAA ACGCATCGCG ACGGACGAGG ACGAGGCGCG CGCGGCCGAG
CTGGAGGCGA AGAGCCGCGA GGCGCTGCCC GTGTTCAAGG AGATGGCTGC CGAGGGCAAC
GGCGACATGC ACCCCGTGTC GGTGGAGTAC CTGTTCGAGG GCGACAAGGC CATCTTCTAC
TTCGAGGCGG AGGAGCGCGT GGACTTCCGC GAGCTCGTGC GCAAGCTGGC CGCGCACTTC
CGCGTGCGCA TCGACATGCG ACAGATCGGC GTGCGCGATG AGGCCCGCAT GGTGGGCGGC
CTGGGGCATT GCGGCCAGGA GCTGTGCTGC AAGCGCCTGG GCGGCGAGTT CTGCCCCGTG
TCCATCCGCA TGGCGAAGGA GCAGGACCTC TCGCTGAACC CGCAGAAGAT ATCGGGCGTG
TGCGGACGGC TCATGTGCTG TCTGCGCTAC GAGTTCGACG CGTACAAGGA CTTCAAGAGC
CGCGCCCCGA AACAGAACGC CACGGTGGAG ACGCCCGATG GGCCGGCGAA GGTGGTGGAT
CTCGACGTGC CGCGCGAGAT CGTGTCGCTG AAGATCATGG GCGAGAAGCC CGTGAAGGTG
CCGCTGGCCG ACTTCGACCC GCCCGAGGAA GGCTCGAACC GCCCGAACCG CGTAGGCGAG
GAGGCGTGGC AGGACGCGAC GACGGCCGAC CCTATCGGAT TTGCGGGCGA GTCGGCGCTG
TTCGGCACCA CGACGCAGCT GACCGGGCAG GACAAGCTGG CCGATCCGGG TTCCGTGCGC
CGCACGGGCC GCGGCGGTCA GAAGCCGTCG AAGGGCGGCG GCTCGAACGG CGGCCGCGCG
GGCGGCGGCC AGAAGGGCGG CGGCAACGGC GGCCAGAAGG GCGGCAAACA GGCCGACGCG
CAGGCGCAGA GCGCGCGCAA GCCGCGCAGG AGGCGCTCGA CGAAGGTCGG CGGCGAGGGC
GCTGCCGCCC CCGAGGCGGC CGAGACGCAG AAGCGCAAGC AGAAGCAGCA AGGCGGCGGC
TCGCCGAAGG GCGGGCAGGG CGGCCAGCAG CAGAAGCGCC GGTCGGGTCA GGGCGGTCAG
AGCGGCAACG GCGGCTCCAA GAAGCAGCAG GGCCAGCGCC AGGGAGGCGA GGGCGCGAAG
AAGCAGGGGC CGAAGGGCAT GCAGCCCTCG AAGCCTCGTC CCGGCCAGAA GTCCTCAGGC
CTGCGCCAGG GCCAGAAGCC GCAGCAGCCG CGCCAGGACA AGGCGCCCCG CCCCGAGCGC
TCGGGGGCTC CGAGCGGCGA GGGCGGCCGC CCGACGGGAG ACGGCGGGCA TCGCCGCGCC
CGTCGCCGCA GCCACAAGGC GGGCGGCTCG GACGGCGCGG GCGCGCCCGG AGCGGGCGGC
GCGGCGCCGA GCGGCGAATA G
 
Protein sequence
MVRIAPINLY YNPKTLWFDA GDLDVRAGDG VIVSTARGTE FGRAAHDVFE ADEAQIKKLK 
SPLKPVKRIA TDEDEARAAE LEAKSREALP VFKEMAAEGN GDMHPVSVEY LFEGDKAIFY
FEAEERVDFR ELVRKLAAHF RVRIDMRQIG VRDEARMVGG LGHCGQELCC KRLGGEFCPV
SIRMAKEQDL SLNPQKISGV CGRLMCCLRY EFDAYKDFKS RAPKQNATVE TPDGPAKVVD
LDVPREIVSL KIMGEKPVKV PLADFDPPEE GSNRPNRVGE EAWQDATTAD PIGFAGESAL
FGTTTQLTGQ DKLADPGSVR RTGRGGQKPS KGGGSNGGRA GGGQKGGGNG GQKGGKQADA
QAQSARKPRR RRSTKVGGEG AAAPEAAETQ KRKQKQQGGG SPKGGQGGQQ QKRRSGQGGQ
SGNGGSKKQQ GQRQGGEGAK KQGPKGMQPS KPRPGQKSSG LRQGQKPQQP RQDKAPRPER
SGAPSGEGGR PTGDGGHRRA RRRSHKAGGS DGAGAPGAGG AAPSGE