Gene Elen_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3046 
Symbol 
ID8417381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3542404 
End bp3543711 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content61% 
IMG OID645026026 
Producthypothetical protein 
Protein accessionYP_003183378 
Protein GI257792772 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCG TTATTGTGAT CCCCACTTTC GTGTCCGCGC GTCGCCGCAA AGAAGGCGGC 
AGCGTGCTCA CGACCTATGA TCACGCGACT CCCATCTCGC AACCCGGCGA GCTCCCGCGT
TTGCTTGCGT CGTTGCAGAA GGTGCGCGGC GCCGGCCAGA TCATCGTGCT CGTGGTCAGC
GAGCCGTCGA TCGAGAACCA GGCGGCCGAG AAGGTTCAAA GCGTTGTCTC GCGCTTTCCC
TCGCTGAACA CCGTGGTCAT CGGCGCTCCC GAGCTGGCAC TTATCCAGCA GCGCATGGAG
CAGCTGGGTT TGGGCAAGCT GCAGAAGGAG ATCGGCCTGT CCGGCTACGG TGCGGTGCGC
AACCTGGGTC TGGTGATGGC CGACGTGCTG GGCTTCGACT CGGTGGTGTT CCTCGACGAC
GACGAAGTGG TGGATGACGC CGACTTCCTG CAGAAGGCCA TGTACGGCCT GGGCAAGCTC
ACGAAGAAGG GCATTCCCAT CCTGGCCAAG ACCGGCTTCT ACTTCAATTC CGAAGGCTCC
TACCTGTCGA AGAGCCAGGA CAAGTGGTAC AACCATTTCT GGCAGCAGGG AAAGGCCTTC
AACAAATGGA TCTCGAAGGC CATGCGCGGC CCTCGTCTTT CCCGATCGAA CCATACGTGC
GGCGGCTGCC TTGCTTTGCA TAAAGAGGCG TTCAAGCGTC TGTCGTTCGA TCCTTGGATC
GCGCGCGGCG AAGATCTCGA TTACATGCTT GACCTGCGTA TGTACGGTTC GGACATCTGG
TTCGACAATC AGTGGAGCCT GCGCCACCTT CCTCCCGAAA CCGAGAGCGA GGGCACGCGC
TTCCGTCAGG ATATCTTCCG ATGGCTCTAC GAATACCGGA AGATGGAGTA CAGCCGCACG
CAGATCGACC TTTTGCAGGT GAAGCCGTCT TCGCTGGAGC CGTATCCGGG CCCGTTCCTT
GAGCCAGGCA TCACGAAGCG CATTCGTTTG ACCGCCTTTC TGAGGAGCTT GGCGCGCCCC
GACAAGAAGG CGTACCGGAA AGCGGCGAAG GCGGCCACCG GCGAAGCGAC GACGTATGCC
CAGCGCAACT GCTCGAAGTA CTTCGAGTTC CAGTTCGTGT GGCCGGAGCT GATGGCGCGC
ATGGAGAACG ATCAGATCCT GCGTACGGCG CTTATGCAGT CGGCCGCGCA GCGCCAGGCC
AGCGCCGGCA ACGGAGCCGA TCGACTTGCT TCGGCGCAGG CGGCCATCGC GGCGGCCGGC
ATCGATCCGG GTGTGACGAG CGAGATTCGC CTGAACGTCG CGGAATAA
 
Protein sequence
MNPVIVIPTF VSARRRKEGG SVLTTYDHAT PISQPGELPR LLASLQKVRG AGQIIVLVVS 
EPSIENQAAE KVQSVVSRFP SLNTVVIGAP ELALIQQRME QLGLGKLQKE IGLSGYGAVR
NLGLVMADVL GFDSVVFLDD DEVVDDADFL QKAMYGLGKL TKKGIPILAK TGFYFNSEGS
YLSKSQDKWY NHFWQQGKAF NKWISKAMRG PRLSRSNHTC GGCLALHKEA FKRLSFDPWI
ARGEDLDYML DLRMYGSDIW FDNQWSLRHL PPETESEGTR FRQDIFRWLY EYRKMEYSRT
QIDLLQVKPS SLEPYPGPFL EPGITKRIRL TAFLRSLARP DKKAYRKAAK AATGEATTYA
QRNCSKYFEF QFVWPELMAR MENDQILRTA LMQSAAQRQA SAGNGADRLA SAQAAIAAAG
IDPGVTSEIR LNVAE