Gene Elen_2428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2428 
Symbol 
ID8416752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2847006 
End bp2848214 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content54% 
IMG OID645025412 
Productprotein of unknown function DUF201 
Protein accessionYP_003182775 
Protein GI257792169 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.281403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGCGA CTGATCGCAA AAAGGTTCTT ATCCTTGGAG GAAGCAGACT TCAAGCCCCT 
GCAATTGAAG TTGCGAAGAG GCTTGGCTTC TGGGTTATTT GCGCCGATTA CGATCCCGAT
GCTGTTGGGT TCGCTCTCGC CGATCAATCA GAGCTCATCA GCACGCTCGA TGCTGAGGCG
GTTCTTGCGC TTGCGAAAAG GGAGCGCGTC GCTTTCGTTA TCACTTCGAC GAGCGATGCC
CCTGTAAGGA CAGCTTCTTT TGTGTCTGAG CAGCTCGATC TGCCCGTCGG CATATCCTAT
GCCGACTCAA TATGCGCGAC CCATAAAGAC GCCATGCGTC GTCGTTTGGC GAAATACAAT
GTCCCTATTC CAGAATTTAG GGTATGCAGC AATCTCGACG AGTTCGTTGA AGCTCTGAAT
TATTTCGAAT ATCGGTGCAT CATCAAGCCA GCGGACAGCG CGGCGAGCAG GGGGGTGAAA
CTGATAGACT CCTCTATTCG TCATGACGAC CCGGAGGAGC TGTTCGAAAG GGGGATGTCT
TTTTCGCGCA AAAGAACCTT AATGGTTGAG CGGTGCATTT CCGGAACCGA GGTTAGCGTC
GAGGGCATGA CCGTCAACGG AAAAACCCAC ATCCTTGCCA TCACCGACAA GATGGTAACG
GAGCCGCCGT ACTTCGTTGA ACTCGGACAC TCTGAGCCAG CTCTCCTGGA AGATGCCGAA
AAGGTTCGCA TAGAAGATGT GGCACGAGCC GCAATCGAAG CCGTGGGAAT CGTAAACGGT
CCCTCTCACA CCGAGATTAT GATCACGGAT AGCGGGCCGA TGGTGATCGA AATCGCCGCA
CGTCTGGGAG GCGACTACAT CACTTCTCGG CTCGTTCCCC TTTCGACAGG CTTCGACATG
GTTGGCGCCT CCGTGGAGCT CGCATTGGGG ATGCCGGTAG ACTTCCCTCC GCCCAAACAG
GATGGGAGTG CTGTAAGATT TATCGTTTCG GACACTGGCG TGATTTCCGA TATCAAGATC
GATTCTGCAA TATACGATCT TCCCGGTTTC GAAGAGCTTG AATTGTACAA GAAGTCGGGG
GATGCGATCT CCGAACCGCA TTCGAGTAAC GACCGCGTCG GTCATGTAAT ATGCACCGGC
CCCGATGCTC TATCGGCAAG AGAGACGGCC GAGAAGGCTC TATCCATGAT CCACGTCTCG
CTTTCCTGA
 
Protein sequence
MGATDRKKVL ILGGSRLQAP AIEVAKRLGF WVICADYDPD AVGFALADQS ELISTLDAEA 
VLALAKRERV AFVITSTSDA PVRTASFVSE QLDLPVGISY ADSICATHKD AMRRRLAKYN
VPIPEFRVCS NLDEFVEALN YFEYRCIIKP ADSAASRGVK LIDSSIRHDD PEELFERGMS
FSRKRTLMVE RCISGTEVSV EGMTVNGKTH ILAITDKMVT EPPYFVELGH SEPALLEDAE
KVRIEDVARA AIEAVGIVNG PSHTEIMITD SGPMVIEIAA RLGGDYITSR LVPLSTGFDM
VGASVELALG MPVDFPPPKQ DGSAVRFIVS DTGVISDIKI DSAIYDLPGF EELELYKKSG
DAISEPHSSN DRVGHVICTG PDALSARETA EKALSMIHVS LS