Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1959 |
Symbol | |
ID | 6874159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1888594 |
End bp | 1891656 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642785079 |
Product | molydopterin dinucleotide domain-containing protein |
Protein accession | YP_002215745 |
Protein GI | 198245236 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00000210512 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTAATT TAACCCGTCG TCAGTGGCTA AAAGTCGGTC TCGCCGTCGG TGGGATGGTC ACTTTTGGTC TGAGCTACCG TGATGTGGCG AAACGCGCAA TTGATGGCCT GTTAAACGGG ACATCCGGCA AGGTAACGCG CGACCGCATC TTTGGCAATG CGTTAATTCC GGAGGCGCAG GCGCAAACGC ACTGGCAGCA AAATCCACAA CAAACCATCG CCATGACGCA ATGCTTCGGC TGTTGGACAC AGTGCGGTAT CCGCGCCCGG GTTAATGCCG ATGGCAAAGT GATACGCATC GCCGGCAATC CCTATCACCC CTTGTCGCAG GAACACCCGA TTGACTCGTC CGTCCCTTTT AGCGAAGCCA TGGAGCAACT GGCGGGAGAA AGCGGTCTTG ACGCCCGCTC AACCGCCTGC GCGCGCGGCG CCACGCTGCT GGAAAGCCTG TACAGTCCGC TACGACTGCT TGAACCGATG AAACGCGTGG GTAAACGCGG CGAAGGGAAA TGGCAGCGCA TCAGCTTTGA GCAACTTATT GAAGAAGTCG TGGAAGGCGG CGATCTGTTT GGCGAAGGTC ATGTGGACGG ACTGCGCGCT ATTCATGCGC CGGATACGCC AATTGACGCA AAGCACCCCA GTTTCGGGCC CAAAACCAAT CAGTTACTGG TCACGAATAC CAGCGACGAA GGCCGCGATG CGTTTCTGCG TCGTTTTGCG CTAAATAGCT TCGGCAGCAA GAATTTCGGC GCGCATGGCG CCTACTGTGG ACTGGCTTAC CGGGCCGGCT CCGGGGCATT GATGGGCGAT CTGGATAAAA ACCCGCATGT CAAACCCGAC TGGGAAAACG TGGAGTTTGC GCTCTTTATG GGCACCTCCC CGGCACAGTC CAGCAATCCG TTTAAACGCC AGGCACGTCA GTTGGCGAGC GCCCGACTGC GTGAGAATTT TCAATACGTC GTGGTTGCCC CCGCCCTCCC CTTATCAACG GTGCTCGCCG ATCCTCGCGG TCGCTGGCAA CCGGTCATGC CCGGCAGCGA TTCGGCGCTG GCAATGGGGA TGATCCGCTG GATCATGGAT AATGAACGTT ATAATGCTGA TTATCTGGCG ATTCCCGGCG TACAGGCTAT GCAGCAGGCC GGCGAGCAAA GTTGGACCAA CGCCACGCAC CTGGTCATTG CGGATGAACT GCCGACGCTT GCCGGACAAC ACCTGACGCT GCGCCATCTT ACGCCCGATG GCGAAGAGAC CCCTGTCGTG CTGAATACCG ACGGCGAGTT GGTCGATGCG TCCACTTGCC GACAGGCACG GCTTTTCGTG ACGCAGTACG TTACGCTCGC CGACGGCCAA CGGGTCACGG TGAAGAGCGG GTTGCAACGC CTGAAAGAGG CGGCAGAAAA GCTCTCGTTG GCGCAATACA GCGAACAGTG CGGCGTACCG GAAGCGCAAA TTATCGCGCT GGCGGAAACC TTTGCCAGTC ACGGACGTAA AGCCGCGGTC ATCAGTCACG GCGGCATGAT GGCCGGCAAT GGGTTTTATA ACGCCTGGTC GGTCATGATG CTTAACGCGC TGATCGGCAA CCTCAGCTTG TCCGGCGGCG TCTTTGTCGG CGGCGGCAAA TTCAACGGCG TTAGCGACGG CCCCCGCTAC AACATGAACA GTTTTGCCGG AAAAGTGAAA CCGTCCGGGT TAAGTATTGC CCGTAGCAAA ACCGCTTATG AAGCATCGGA AGAATACCGC GACAAAATTG CCGGCGGGCA ATCCCCTTAT CCAGCCAAAG CGCCGTGGTA TCCCTTTGTG GCAGGCCAGC TTACCGAACT GTTGACCTCC GCGCTCGAAG GCTATCCTTA TCCGCTTAAA GCCTGGATTT CCAATATGAG CAACCCGTTT TACGGTGTTC CCGGTCTACG CGCCGTGGCG GAAGAAAAGC TAAAAGACCC TCGCCGACTG CCGCTCTTTA TCGCGATTGA CGCCTTTATG AATGAAACGA CGGCGCTGGC GGATTACATT GTGCCGGATA CGCACAATTT TGAGAGCTGG GGCTTTACGG CGCCCTGGGG CGGCGTAGCC AGTAAAGCCA CTACCGCCCG CTGGCCGGTT GTCGCCCCCG CCACTCGCCG AACGGTGGAC GGGCAACCTG TCTCAATGGA AGCATTTTGT ATTGCGGTAG CAAAACGGCT CCATCTGCCC GGCTTCGGCG ACCGGGCGAT AACCGATCCG CAGGGCAATA CTTTTCCACT GAACCGGGCG GAAGACTTCT ATCTGCGCGT AGCCGCTAAT ATCGCCTTTA TGGGCAAGAC GCCGGTCGCG CTGGCAAATC AGGAAGATAT TTCGCTTACC GGCGTCAGCC GCATTCTGCC AGCAATTCAG CACACGCTTA AAGCTGATGA GGTCGGTCGC GTGGCGTTTA TCTACTCGCG TGGCGGCCGG TTTGCGCCCG AGGATAGCGG CTATACGGAG CAACGGTTAG GTAACGCGTG GAAAAAACCC TTACAGATCT GGAATGCAGA TGTCGCCGCC CACCGTCACG CCATCACCGG GGAGCGCTTC AGCGGTTGCC CGGTCTGGTA TCCGGCGCGT TTGTCAGATG GTCGTGCGAT TGACGACCAG TTTCCCATTG GGCAATGGCC GCTGAAACTG ATTTCATTTA AATCAAATAC CATGTCCAGC TCAACAGCCG TCATCCCGCG CTTACACCAT GTGAAGCCAG CAAACCTGGT GGCGCTGAAT CCGCAAGACG GCGAGCGTTA TGGACTGCAA CATGGCGATC GGGTACGGAT CATTACGCCG GGCGGTCAGG TCGTGGCGCA AATCAGTTTG TTAAATGGCG TGATGCCAGG CGTCATCGCC ATCGAACACG GATATGGCCA CCGCGAGATG GGCGCAACGC AGCACTCTCT GGATGGCGTG CCTATGCCGT ATGATCCACA AATCAGGGCA GGCATAAATC TTAACGATCT GGGCTTTGCC GATCCGACAA GAACCATTAC CAACACCTGG CTCGACTGGG TTTCTGGCGC GGCAGTACGT CAGGGGCTGC CGGCAAAAAT CGAGCGTATA TAA
|
Protein sequence | MANLTRRQWL KVGLAVGGMV TFGLSYRDVA KRAIDGLLNG TSGKVTRDRI FGNALIPEAQ AQTHWQQNPQ QTIAMTQCFG CWTQCGIRAR VNADGKVIRI AGNPYHPLSQ EHPIDSSVPF SEAMEQLAGE SGLDARSTAC ARGATLLESL YSPLRLLEPM KRVGKRGEGK WQRISFEQLI EEVVEGGDLF GEGHVDGLRA IHAPDTPIDA KHPSFGPKTN QLLVTNTSDE GRDAFLRRFA LNSFGSKNFG AHGAYCGLAY RAGSGALMGD LDKNPHVKPD WENVEFALFM GTSPAQSSNP FKRQARQLAS ARLRENFQYV VVAPALPLST VLADPRGRWQ PVMPGSDSAL AMGMIRWIMD NERYNADYLA IPGVQAMQQA GEQSWTNATH LVIADELPTL AGQHLTLRHL TPDGEETPVV LNTDGELVDA STCRQARLFV TQYVTLADGQ RVTVKSGLQR LKEAAEKLSL AQYSEQCGVP EAQIIALAET FASHGRKAAV ISHGGMMAGN GFYNAWSVMM LNALIGNLSL SGGVFVGGGK FNGVSDGPRY NMNSFAGKVK PSGLSIARSK TAYEASEEYR DKIAGGQSPY PAKAPWYPFV AGQLTELLTS ALEGYPYPLK AWISNMSNPF YGVPGLRAVA EEKLKDPRRL PLFIAIDAFM NETTALADYI VPDTHNFESW GFTAPWGGVA SKATTARWPV VAPATRRTVD GQPVSMEAFC IAVAKRLHLP GFGDRAITDP QGNTFPLNRA EDFYLRVAAN IAFMGKTPVA LANQEDISLT GVSRILPAIQ HTLKADEVGR VAFIYSRGGR FAPEDSGYTE QRLGNAWKKP LQIWNADVAA HRHAITGERF SGCPVWYPAR LSDGRAIDDQ FPIGQWPLKL ISFKSNTMSS STAVIPRLHH VKPANLVALN PQDGERYGLQ HGDRVRIITP GGQVVAQISL LNGVMPGVIA IEHGYGHREM GATQHSLDGV PMPYDPQIRA GINLNDLGFA DPTRTITNTW LDWVSGAAVR QGLPAKIERI
|
| |