Gene SeD_A1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1959 
Symbol 
ID6874159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1888594 
End bp1891656 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content58% 
IMG OID642785079 
Productmolydopterin dinucleotide domain-containing protein 
Protein accessionYP_002215745 
Protein GI198245236 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00000210512 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTAATT TAACCCGTCG TCAGTGGCTA AAAGTCGGTC TCGCCGTCGG TGGGATGGTC 
ACTTTTGGTC TGAGCTACCG TGATGTGGCG AAACGCGCAA TTGATGGCCT GTTAAACGGG
ACATCCGGCA AGGTAACGCG CGACCGCATC TTTGGCAATG CGTTAATTCC GGAGGCGCAG
GCGCAAACGC ACTGGCAGCA AAATCCACAA CAAACCATCG CCATGACGCA ATGCTTCGGC
TGTTGGACAC AGTGCGGTAT CCGCGCCCGG GTTAATGCCG ATGGCAAAGT GATACGCATC
GCCGGCAATC CCTATCACCC CTTGTCGCAG GAACACCCGA TTGACTCGTC CGTCCCTTTT
AGCGAAGCCA TGGAGCAACT GGCGGGAGAA AGCGGTCTTG ACGCCCGCTC AACCGCCTGC
GCGCGCGGCG CCACGCTGCT GGAAAGCCTG TACAGTCCGC TACGACTGCT TGAACCGATG
AAACGCGTGG GTAAACGCGG CGAAGGGAAA TGGCAGCGCA TCAGCTTTGA GCAACTTATT
GAAGAAGTCG TGGAAGGCGG CGATCTGTTT GGCGAAGGTC ATGTGGACGG ACTGCGCGCT
ATTCATGCGC CGGATACGCC AATTGACGCA AAGCACCCCA GTTTCGGGCC CAAAACCAAT
CAGTTACTGG TCACGAATAC CAGCGACGAA GGCCGCGATG CGTTTCTGCG TCGTTTTGCG
CTAAATAGCT TCGGCAGCAA GAATTTCGGC GCGCATGGCG CCTACTGTGG ACTGGCTTAC
CGGGCCGGCT CCGGGGCATT GATGGGCGAT CTGGATAAAA ACCCGCATGT CAAACCCGAC
TGGGAAAACG TGGAGTTTGC GCTCTTTATG GGCACCTCCC CGGCACAGTC CAGCAATCCG
TTTAAACGCC AGGCACGTCA GTTGGCGAGC GCCCGACTGC GTGAGAATTT TCAATACGTC
GTGGTTGCCC CCGCCCTCCC CTTATCAACG GTGCTCGCCG ATCCTCGCGG TCGCTGGCAA
CCGGTCATGC CCGGCAGCGA TTCGGCGCTG GCAATGGGGA TGATCCGCTG GATCATGGAT
AATGAACGTT ATAATGCTGA TTATCTGGCG ATTCCCGGCG TACAGGCTAT GCAGCAGGCC
GGCGAGCAAA GTTGGACCAA CGCCACGCAC CTGGTCATTG CGGATGAACT GCCGACGCTT
GCCGGACAAC ACCTGACGCT GCGCCATCTT ACGCCCGATG GCGAAGAGAC CCCTGTCGTG
CTGAATACCG ACGGCGAGTT GGTCGATGCG TCCACTTGCC GACAGGCACG GCTTTTCGTG
ACGCAGTACG TTACGCTCGC CGACGGCCAA CGGGTCACGG TGAAGAGCGG GTTGCAACGC
CTGAAAGAGG CGGCAGAAAA GCTCTCGTTG GCGCAATACA GCGAACAGTG CGGCGTACCG
GAAGCGCAAA TTATCGCGCT GGCGGAAACC TTTGCCAGTC ACGGACGTAA AGCCGCGGTC
ATCAGTCACG GCGGCATGAT GGCCGGCAAT GGGTTTTATA ACGCCTGGTC GGTCATGATG
CTTAACGCGC TGATCGGCAA CCTCAGCTTG TCCGGCGGCG TCTTTGTCGG CGGCGGCAAA
TTCAACGGCG TTAGCGACGG CCCCCGCTAC AACATGAACA GTTTTGCCGG AAAAGTGAAA
CCGTCCGGGT TAAGTATTGC CCGTAGCAAA ACCGCTTATG AAGCATCGGA AGAATACCGC
GACAAAATTG CCGGCGGGCA ATCCCCTTAT CCAGCCAAAG CGCCGTGGTA TCCCTTTGTG
GCAGGCCAGC TTACCGAACT GTTGACCTCC GCGCTCGAAG GCTATCCTTA TCCGCTTAAA
GCCTGGATTT CCAATATGAG CAACCCGTTT TACGGTGTTC CCGGTCTACG CGCCGTGGCG
GAAGAAAAGC TAAAAGACCC TCGCCGACTG CCGCTCTTTA TCGCGATTGA CGCCTTTATG
AATGAAACGA CGGCGCTGGC GGATTACATT GTGCCGGATA CGCACAATTT TGAGAGCTGG
GGCTTTACGG CGCCCTGGGG CGGCGTAGCC AGTAAAGCCA CTACCGCCCG CTGGCCGGTT
GTCGCCCCCG CCACTCGCCG AACGGTGGAC GGGCAACCTG TCTCAATGGA AGCATTTTGT
ATTGCGGTAG CAAAACGGCT CCATCTGCCC GGCTTCGGCG ACCGGGCGAT AACCGATCCG
CAGGGCAATA CTTTTCCACT GAACCGGGCG GAAGACTTCT ATCTGCGCGT AGCCGCTAAT
ATCGCCTTTA TGGGCAAGAC GCCGGTCGCG CTGGCAAATC AGGAAGATAT TTCGCTTACC
GGCGTCAGCC GCATTCTGCC AGCAATTCAG CACACGCTTA AAGCTGATGA GGTCGGTCGC
GTGGCGTTTA TCTACTCGCG TGGCGGCCGG TTTGCGCCCG AGGATAGCGG CTATACGGAG
CAACGGTTAG GTAACGCGTG GAAAAAACCC TTACAGATCT GGAATGCAGA TGTCGCCGCC
CACCGTCACG CCATCACCGG GGAGCGCTTC AGCGGTTGCC CGGTCTGGTA TCCGGCGCGT
TTGTCAGATG GTCGTGCGAT TGACGACCAG TTTCCCATTG GGCAATGGCC GCTGAAACTG
ATTTCATTTA AATCAAATAC CATGTCCAGC TCAACAGCCG TCATCCCGCG CTTACACCAT
GTGAAGCCAG CAAACCTGGT GGCGCTGAAT CCGCAAGACG GCGAGCGTTA TGGACTGCAA
CATGGCGATC GGGTACGGAT CATTACGCCG GGCGGTCAGG TCGTGGCGCA AATCAGTTTG
TTAAATGGCG TGATGCCAGG CGTCATCGCC ATCGAACACG GATATGGCCA CCGCGAGATG
GGCGCAACGC AGCACTCTCT GGATGGCGTG CCTATGCCGT ATGATCCACA AATCAGGGCA
GGCATAAATC TTAACGATCT GGGCTTTGCC GATCCGACAA GAACCATTAC CAACACCTGG
CTCGACTGGG TTTCTGGCGC GGCAGTACGT CAGGGGCTGC CGGCAAAAAT CGAGCGTATA
TAA
 
Protein sequence
MANLTRRQWL KVGLAVGGMV TFGLSYRDVA KRAIDGLLNG TSGKVTRDRI FGNALIPEAQ 
AQTHWQQNPQ QTIAMTQCFG CWTQCGIRAR VNADGKVIRI AGNPYHPLSQ EHPIDSSVPF
SEAMEQLAGE SGLDARSTAC ARGATLLESL YSPLRLLEPM KRVGKRGEGK WQRISFEQLI
EEVVEGGDLF GEGHVDGLRA IHAPDTPIDA KHPSFGPKTN QLLVTNTSDE GRDAFLRRFA
LNSFGSKNFG AHGAYCGLAY RAGSGALMGD LDKNPHVKPD WENVEFALFM GTSPAQSSNP
FKRQARQLAS ARLRENFQYV VVAPALPLST VLADPRGRWQ PVMPGSDSAL AMGMIRWIMD
NERYNADYLA IPGVQAMQQA GEQSWTNATH LVIADELPTL AGQHLTLRHL TPDGEETPVV
LNTDGELVDA STCRQARLFV TQYVTLADGQ RVTVKSGLQR LKEAAEKLSL AQYSEQCGVP
EAQIIALAET FASHGRKAAV ISHGGMMAGN GFYNAWSVMM LNALIGNLSL SGGVFVGGGK
FNGVSDGPRY NMNSFAGKVK PSGLSIARSK TAYEASEEYR DKIAGGQSPY PAKAPWYPFV
AGQLTELLTS ALEGYPYPLK AWISNMSNPF YGVPGLRAVA EEKLKDPRRL PLFIAIDAFM
NETTALADYI VPDTHNFESW GFTAPWGGVA SKATTARWPV VAPATRRTVD GQPVSMEAFC
IAVAKRLHLP GFGDRAITDP QGNTFPLNRA EDFYLRVAAN IAFMGKTPVA LANQEDISLT
GVSRILPAIQ HTLKADEVGR VAFIYSRGGR FAPEDSGYTE QRLGNAWKKP LQIWNADVAA
HRHAITGERF SGCPVWYPAR LSDGRAIDDQ FPIGQWPLKL ISFKSNTMSS STAVIPRLHH
VKPANLVALN PQDGERYGLQ HGDRVRIITP GGQVVAQISL LNGVMPGVIA IEHGYGHREM
GATQHSLDGV PMPYDPQIRA GINLNDLGFA DPTRTITNTW LDWVSGAAVR QGLPAKIERI