Gene Elen_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1063 
Symbol 
ID8415353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1284889 
End bp1286556 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content67% 
IMG OID645024026 
Productvon Willebrand factor type A 
Protein accessionYP_003181423 
Protein GI257790817 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000708626 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACGCTT CCAACCGCAG CCGCGGCAGG CTCGCCGCGG GCTCCATCGC CTTCGCCGTC 
CTGATCGCCG GCGCATCGCT TGCCGGATGC AGCCCCGACG GTCAGGCGGG CGACCAGCTA
GGATCGGCTG CAAGCGAATC GGAGATCATG GCCATCGGTT CCGCGCTGTC CGAGACGGCC
TCCACATGCC CTCCTCCCTA TCCGTACGTT CCCTCGCCCT CTCCCGGCGG CACCGAGGAA
TACCGTGCAC TCGACGAGCC GGGGTTCCTC TCCCCTGCGA CCAGCCCGCT GTCCACGCTG
TCGGCCGACG TCGACACGGC CTCGTACTGC AACCTGCGCC GCATGGTGGC GCAGAGATAC
GCGCCGGCCG TCGTGCCCGC CGGCGCCGTA CGCACCGAAG AGCTGCTCAA CTACTTCGAC
TACGCCTACC CGGAGCCCGT TGGCTCCGAC TTGTTCGGCG TATCGGCCCA GATGAGCGAC
TGTCCTTGGA ACGACCAGAC GAAGCTGCTG GTCATGGGAT TCGCCACCGA GAAGGACGGC
GACGCTTCGC CCACGGGCGC CAACCTCGTA TTCCTCATCG ACGTCTCGGG GTCGATGGAC
GACCCTGACA AGCTCCCCCT GGTCAAAGAC TCGTTCGCCG CGCTCGTCGA AGGGCTGACG
GAGCGCGACC GCGTGTCCGT CGTAACCTAC GCCAGCGGCG AGCGCGTGCT GCTCGAAGGC
GTGCCGGGCG ACGACAAGCG GCGTATCATG CGCGCCGTCG ACAGCCTCGT CGCCGAAGGG
TCGACGAACG GGGAAGCCGG TTTGGAGCAG GCGTACCGCC TGGCGGAATC GTCGTTCATC
GAAGGCGGTG TGAACCGCGT CGTCATGGCG TCGGACGGCG ACCTCAACGT GGGCATCTCG
TCCGAGAGCG AGCTGCACGA CTTCGTCGAG CAGAAGCGCG AGACCGGCGT GTACCTCTCG
GTGCTGGGAT TCGGCTCGGG CAACTACAAG GACAACAAGA TGGAGACGCT GGCCGACCAC
GGCAACGGCG CCTACCACTA CATCGACTGC GCCGAAGAAG CCCGACGGGT GCTCGGCCGG
AACCTCCGTG CGAACCTCGT GCCGCTTGCC GACGATGTGA AGATCCAGGT GGAATTCAAT
CCTGACCGGG TGAAGGGCTA TCGGCTGATC GGCTACGAGA ACCGCGCGCT CGCCGACGAG
GAGTTCCGCG ACGATGCGGG CGAGGTGGGC GCGGGCCATG CGTTCACCGT GGCGTACGAG
ATCGTCCCCG CAGGATCGGC GTTCGAGGTG GGCGCGTCCG CATCGAAATA CGGAAGCGAT
GCCGACGACC GGCAGGACGG TCGCCGCTCC GAAGCGAACG GCGGAGAATG GCTGACGTGC
ACGATGCGCT ACCGCCCTGC GGGAACCGTC GAAGCGGTGG AGCAGGCGCT GGTGGTCGAC
GATGAGAGCT GCACCGACGA TCCGAACGGA GATTGGACGT TCGCCGCCGC CGTCATCGAG
TGCGGCATGG CGCTGCACCG CTCGCCCCAT GCCGGCGCCG CCACCCTCGA AAGCGCCCGC
GACCTGCTGG CAAGCTGCGA GCTCACCGAC CAGCAGCAAG GCTTCGAAAC CCTCCTCGCC
GACCTCGCCC GCCAAGAGGG AGCGCACGGG TCATGCAACC GGTACTGA
 
Protein sequence
MHASNRSRGR LAAGSIAFAV LIAGASLAGC SPDGQAGDQL GSAASESEIM AIGSALSETA 
STCPPPYPYV PSPSPGGTEE YRALDEPGFL SPATSPLSTL SADVDTASYC NLRRMVAQRY
APAVVPAGAV RTEELLNYFD YAYPEPVGSD LFGVSAQMSD CPWNDQTKLL VMGFATEKDG
DASPTGANLV FLIDVSGSMD DPDKLPLVKD SFAALVEGLT ERDRVSVVTY ASGERVLLEG
VPGDDKRRIM RAVDSLVAEG STNGEAGLEQ AYRLAESSFI EGGVNRVVMA SDGDLNVGIS
SESELHDFVE QKRETGVYLS VLGFGSGNYK DNKMETLADH GNGAYHYIDC AEEARRVLGR
NLRANLVPLA DDVKIQVEFN PDRVKGYRLI GYENRALADE EFRDDAGEVG AGHAFTVAYE
IVPAGSAFEV GASASKYGSD ADDRQDGRRS EANGGEWLTC TMRYRPAGTV EAVEQALVVD
DESCTDDPNG DWTFAAAVIE CGMALHRSPH AGAATLESAR DLLASCELTD QQQGFETLLA
DLARQEGAHG SCNRY