Gene Elen_2622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2622 
Symbol 
ID8416947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3046346 
End bp3048025 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content65% 
IMG OID645025600 
Productphage terminase, large subunit, PBSX family 
Protein accessionYP_003182962 
Protein GI257792356 
COG category[R] General function prediction only 
COG ID[COG1783] Phage terminase large subunit 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000146822 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCCAGA AGTTGACACA GAACCAAGAG CTGTATTGCC AAGCCCGCGC GAGGGGCCTG 
TCGCAGCGCC GCGCCTACAG GTCCGCGTAC CCGAAGTGCA ATTCGACCGA CGCGGCGGTA
GACGCGAAGG CGTGCAACCT CGAAAAACAA GCTAAGGTTT CGGCAAGGTT GCACGAGCTG
AACGAGGCCG GGGCGCGCGA CGCGAAGCTC ACGCGTGGGC GGCTCCTGCG CCGCCTCGAC
AGCCTAGCTG ATACCGCATG GGCGCGAGTA GCCGAGGACG CCGAAACGGG CCGCAGAATC
GACCCTGCGG CCTCGCACGC GCTCGTCGCC GCGTCCCGCG AGCTGTTGCC GTACGCCGAG
GACGACGCCA CGGTGCGCCC GCTGTTCGTC GCGGACTTCG GCCTGCTCAT ATCGCCCGAC
TTCTGCAAGC CGCACCGCAT GATCGCGCGA CGCGAGATAA CCGACGTGTG GCTCGGCGGC
GGGCGCGGCT CCATGAAGTC GTCCTATGCC TCGCTCGAAG TGGTCAACTA CATAGAGCAG
AACCCCGAGC AGCACGCGCT CGTCTTGATG AAGTACAAGA CGGCGATACG CGATGCCGCC
TATGCGCAGG TCGTGTGGGC GATCAAGATG CTCGGCCTTG AAGACGAGTA CGAAATGCCC
GATTCCACGC TGCGCATCAA GAAGCGCAGC ACGGGCCAGT TGATCATCTT CCGTGGCTGC
GACAACGCGC AGAAGATCAA ATCCATCAAG GTGCCGTTCG GCCATATCGG CGTCGCCTGG
TACGAGGAAG CCGACATGTT CAAGGGCATG GCGGAAATCC GCAAGGTGAA CCAATCGCTC
ACGCGCGGCG GCAACGACTG CATACGCCTC TACACGTACA ACCCGCCCCG CTCGGTGCAC
TCGTGGATCA ACGTCGAAAT GCAGCGCCGG CGCGATGCGG GCGAACCGGT GTTCACGTCG
AACTACCTCA ACGCCCCGCG CGAATGGCTC GGCGACCAGT TCCACGCAGA CGCCGAGGAA
TTGAAGCGTA TCGACCTCAA GGCATATTTG CACGAGTACA TGGGCGAGGC CGTCGGCATG
GGCCTCGAGG TGTTCGACCC CGAGAAGGTT GTATTCCGCG AGATAACGGA CGAGGAGATC
GCCGCCTTCG ACAACCTCAA GGCAGGCCAG GACTTCGGAT GGTACCCGGA CCCGTGGGCG
TTCACGCTGT CCGAGTGGCG GCAGAACACG CGTACACTGC TCACATTCCG GGAGGACGGC
GCGAACAAGC TGCACCCCGG CGAGCAGGCG AAGCGCATAC GTGCGCTGCT CACGTGGCGC
GACACGCCCG ACGGCGATCC CGTCTACCAC CATATCCCCG TGCGGTCGGA CGATGCCGCG
CCGGAGGCCA TAGCGGCGCA GAGGGACGCG GGGATCAACG CACGCGAGGC CGGCAAGGGC
AACATGCGCG ACGCGTCGTA TCGCTTCGTG CAATCGAGCA CGTGGGTCAT CGACCCCGTG
CGGTGCCCGA AGCTCGCGGC GGAGGTTCGG GCGATGCAGT ACGCGGTCAA CAAGGACGGC
GAAGTGCTCA ACGAGATACC CGACGGGAAC GACCACTGGG TAGACGCCGT GCGCTATTCG
CTCATGCCGA TAGTGCGCCG CGCGAGGGGC GCATACCGCG CGACGCCGGC CGAAGAATAG
 
Protein sequence
MAQKLTQNQE LYCQARARGL SQRRAYRSAY PKCNSTDAAV DAKACNLEKQ AKVSARLHEL 
NEAGARDAKL TRGRLLRRLD SLADTAWARV AEDAETGRRI DPAASHALVA ASRELLPYAE
DDATVRPLFV ADFGLLISPD FCKPHRMIAR REITDVWLGG GRGSMKSSYA SLEVVNYIEQ
NPEQHALVLM KYKTAIRDAA YAQVVWAIKM LGLEDEYEMP DSTLRIKKRS TGQLIIFRGC
DNAQKIKSIK VPFGHIGVAW YEEADMFKGM AEIRKVNQSL TRGGNDCIRL YTYNPPRSVH
SWINVEMQRR RDAGEPVFTS NYLNAPREWL GDQFHADAEE LKRIDLKAYL HEYMGEAVGM
GLEVFDPEKV VFREITDEEI AAFDNLKAGQ DFGWYPDPWA FTLSEWRQNT RTLLTFREDG
ANKLHPGEQA KRIRALLTWR DTPDGDPVYH HIPVRSDDAA PEAIAAQRDA GINAREAGKG
NMRDASYRFV QSSTWVIDPV RCPKLAAEVR AMQYAVNKDG EVLNEIPDGN DHWVDAVRYS
LMPIVRRARG AYRATPAEE