Gene Elen_1514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1514 
Symbol 
ID8415812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1805016 
End bp1806395 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content65% 
IMG OID645024482 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_003181871 
Protein GI257791265 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.623533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGG CGGGCTACGT CGAGGAGCTG CTCGGCGTCC GGGGCTGCGA GCTCATCGGC 
AGTGCGACCA AGGTCGATTG CGGCAGGAGG CTCCAGCAGT TCGACCTGAG ATACCGCGGG
CCCGTCCCCG CATCGTGCCC CGAGTGCGGC GGGACGCTGC ACAGCCACGG CGCGAGGACC
GTCGGCGTTG TGTCGACGCC ACACTTGGGC ATCCCGACGA GGCTCGAGAT AGGGTTCCCG
CGGATGCGAT GCCCGGAGTG CGGCTACGTG TGGCGCCCGG CGATAGGCGG GGTCGACGCG
GGTCACCGAA TGACGGAGGC GGCATACGCC GACATCGCCC AGCGCTCACT CAGGCTCACC
TTCCGCGAGG TCGCCGAGGA GTACCCGCTC TCGCACGTCA CCGTGAAGAA CGTCTTCGAG
GACTACGTCC GCGAGAACGC CTCGAGGCTT CGCTTCAAGG TGCCGGCGTT CCTGGGCATA
GACGAGAAGA ACCTCAAGAG GGTCGGCATG GTGACCGTGA TCACCGACCT CGAGCACAGG
ACGGTCTTCG ACATGGTGCC CGGCAGGACG CAGTCCGACC TCGACGCCTA TTTCTCCTCG
TTGGAGGGCC TCGAGCGCGT ACGATGGGTC TCGAGCGACA TGTACCGGCC GTTCTGGAGG
AGCATAGCCA AGTACACCCC GAACGCGACA TGGGTCATCG ACCACTTCCA TGTCGTGAGG
GGAGCCAACG AGGCCTTAGA CGCGGTCCGC AAGGGCCTCC AGGGAGCCCT CGACAGGAAG
GGCCGCCTTG AGCTAAAGAA GGGCCTCGCC TACGCCCTCA GGAAGCGGAC ACGCGACCTC
AGCCCCTACG AGGCGTCGGC CCTAAGGGCG CTACGGGAAG ACCCCTCCTA CTCGACGCTG
ATGACCGCGT ACGACCTCAA GGAGGACTTC TTCGGCATTT ACGACGACCA TCCGTCCTCA
CGCGAGGAGG CTGAGGCGGC GTTCGACGCC TGGATTCGGG AAATTCCCGA TGGCAGGGAG
TTCGATCCGT TCAGAGCCCT TGCTCGGACC GTCCAGAACC ACCGCGAGTT CATCTTCAAC
TACTGGGAAT GCCCCAGCCG CATCTCGAAC GGCTACACCG AGTGCGCGAA CCGGCTCATC
AACGAAACGG ACATGAGAGG GCGCGGATAC TCGTTCGAGA CGCTTCGGGC AAGGACCCTC
TACCGCAGGC AGAACCTCGA CCGCATCATC GCGAGCAACG GGCTCACGAT CGGCCCTCGC
ATCGATGCTC CCGGCCCGCT CTTCGTGACC GAGCCCGACC GCGAGGACGA GGCCGTGGAC
GAGTTCATAG ACCCGAGGTC GGGAGTGAAG GTCGACGCAA CGACTGGGGA GGTCCATTAA
 
Protein sequence
MGAAGYVEEL LGVRGCELIG SATKVDCGRR LQQFDLRYRG PVPASCPECG GTLHSHGART 
VGVVSTPHLG IPTRLEIGFP RMRCPECGYV WRPAIGGVDA GHRMTEAAYA DIAQRSLRLT
FREVAEEYPL SHVTVKNVFE DYVRENASRL RFKVPAFLGI DEKNLKRVGM VTVITDLEHR
TVFDMVPGRT QSDLDAYFSS LEGLERVRWV SSDMYRPFWR SIAKYTPNAT WVIDHFHVVR
GANEALDAVR KGLQGALDRK GRLELKKGLA YALRKRTRDL SPYEASALRA LREDPSYSTL
MTAYDLKEDF FGIYDDHPSS REEAEAAFDA WIREIPDGRE FDPFRALART VQNHREFIFN
YWECPSRISN GYTECANRLI NETDMRGRGY SFETLRARTL YRRQNLDRII ASNGLTIGPR
IDAPGPLFVT EPDREDEAVD EFIDPRSGVK VDATTGEVH