Gene Elen_1342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1342 
Symbol 
ID8415640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1606887 
End bp1608743 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content58% 
IMG OID645024311 
Product5'-Nucleotidase domain protein 
Protein accessionYP_003181700 
Protein GI257791094 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.206839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.550173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCC GGGCACGCAC CTTTATAGCG GCTTTGACCG TCATGGCCAC GATGGCGGGG 
CTGACCTCAT GCGCGCACTC GCAACCGGAA CCCGGCTACC CCATCTGCAC GGAAAACGAG
ATCGTGACCT ACGAGCCGAC GGACACGACC AAAACCGTCA TCACCATAGG TCGCTACACC
ATATTCAATT CGGAGCCGTT GCAAAAGGCT TTGAGCGAGC GCATACCCGA GGCGAGCTTC
GCCTTCGTCG ACGCCCCCGG CACCAACGAC GTGAGGGCGT ACGTCAAAGA GCAGGCTGAA
CGCAACGACC TACCCGATAT GGTTTTCAGC GGCATGCGCG TGGGGTCGGG CGAATACGCG
TACGATCTGT CCGCGGAAGG GTTCACGGGC CGCTACAACC TTTCGGCGCT TGAAAAGCTG
AGCGTTGACG GAGCTTTGCG GCAACTGCCC ATCAACAGCT CCGTCAAAGG CATTTTCTAC
AACAAAACGC TTTTCGAGGA GCACGGCTGG GAAATCCCGA CAACCCTTGA TGAGTTCTAC
GACCTTTGCG ATGCCATCAC AGCCGAAGGC ATCCGTCCCT TCGTCCCTTG CCTGAAATAT
TCCGTGCAAG ACGTTGGTTT GGGTCTGACC AGCCGAGAAG TGTTCGGCAC GTCGGAAAAG
CGCGCCCGGT ACGATGACGT GGTCAACAAA GAGGCTTCCT GCGAAGGACT GCTTGAACCG
TACTACGAGA CGCTCAAGCA GCTGTACGAT CGAGGGATCG TGGTGGAAAG CGATTTCACG
TCGAGCCTTA CCCAAAACCG TCAGGCCATG TATGCCGGAG AGATTGCGAT GATCCCCAGC
GATCTGTCGA TGTACAGTCT GTACGAGCAG GAAAAACCCG GTTGCGAGAT CGACTTCATA
GGGTTTCCGA CCGACACGCC CAACGAGCGA TGGATGCAGA TGTCTCTGGG CGTGAACATG
ATGGCCTCCC AAAAGTCCAT GGAAGACCCG CAGAAGAAGC GCATCCTGCT TGACGCGCTG
GACTTCCTGA GCTCGGACGA GGGGCAGGCC GTCCTGTTCG AGTGCTTCAG CGGCATAAGC
AACGTCAAAT CGTACCAACA GAACATTCGA CCCGAGTTCT GGGACGTGAA GAACTGCCTT
GACGCAGGCT CCATTTACTT CGCCGACAGA GTCGGTATGA CGTCCGATTT CGAAACCGCA
TTCGAATGGA TGCGCGGCAA CATGACCATG CAGGAAATCA TAAAGGCGAC CGACGACTTC
GCACCCTGCA ATCTGTACGA ATCGATGGAG ACCCCCGTCA TCGGAAAAGC GGCCGAGGAT
TTCACCGTGC TGGAGACCAG CAACCTTATA GCGGACGCCA TGCGCGACGC CTCCGGTGCC
GACGTAGCGC TGCTGATCAA CAACTACTAT TACAAAGGGA ATTCCGGAAA GCTCTATCAA
GGGGACATCT CCCTGGCCGA CCGCTTCAAC CTGCGCAGCG TCACGACCGA CGATGTCCTG
ACGACATACG AGATCAGCGG AACGGATCTG AAAAAGCTCA TGGAGCATCC GAAGATCGGC
GGCGAAGAGA TCAACGCGAC GTACGCGCTC TCGGGTCTCA AGATGGAATA CACGCCCTGG
CGCGCCGCCG ACCAGAACGT GCTGAGCCTG ACGTTGCCCG ACGGGACCGA GATCTCCGAC
GACGCGCAGT ACACGGTGGC CGCCTGGGCC GGGTCGATCG ACGAGTCGTA CATCGGATCC
GTCTTGGAAG CGCATGCGGA CGCGGGGACG AACGTCGATT TGATGACCGC GTACTTGGGC
CGCGTTGGCG AGGTTTCCCC TGCAAAAGAT GGGCGTATCA CGCTGATCTG GGACTGA
 
Protein sequence
MKTRARTFIA ALTVMATMAG LTSCAHSQPE PGYPICTENE IVTYEPTDTT KTVITIGRYT 
IFNSEPLQKA LSERIPEASF AFVDAPGTND VRAYVKEQAE RNDLPDMVFS GMRVGSGEYA
YDLSAEGFTG RYNLSALEKL SVDGALRQLP INSSVKGIFY NKTLFEEHGW EIPTTLDEFY
DLCDAITAEG IRPFVPCLKY SVQDVGLGLT SREVFGTSEK RARYDDVVNK EASCEGLLEP
YYETLKQLYD RGIVVESDFT SSLTQNRQAM YAGEIAMIPS DLSMYSLYEQ EKPGCEIDFI
GFPTDTPNER WMQMSLGVNM MASQKSMEDP QKKRILLDAL DFLSSDEGQA VLFECFSGIS
NVKSYQQNIR PEFWDVKNCL DAGSIYFADR VGMTSDFETA FEWMRGNMTM QEIIKATDDF
APCNLYESME TPVIGKAAED FTVLETSNLI ADAMRDASGA DVALLINNYY YKGNSGKLYQ
GDISLADRFN LRSVTTDDVL TTYEISGTDL KKLMEHPKIG GEEINATYAL SGLKMEYTPW
RAADQNVLSL TLPDGTEISD DAQYTVAAWA GSIDESYIGS VLEAHADAGT NVDLMTAYLG
RVGEVSPAKD GRITLIWD