Gene SeD_A2439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2439 
SymbolwcaM 
ID6872890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2311681 
End bp2313078 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content51% 
IMG OID642785528 
Productputative colanic acid biosynthesis protein 
Protein accessionYP_002216186 
Protein GI198242949 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.71935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0000506464 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATAAGT TCTCCCGACG TACCCTCCTG ACGGCAGGTT CCGCGCTTGC TGTTCTTCCT 
TTTCTGCGCG CCTTGCCGGT ACAGGCGCGT GAACCTCGCG AGACCGTCGA TATTAAGGAT
TATCCGGCGG ATGACGGTAT CGCCTCGTTC AAACAGGCCT TCGCCGACGG ACAGACCGTG
GTCGTACCGC CAGGATGGGT GTGTGAAAAT ATCAATGCGG CGATAACGAT TCCGGCGGGA
AAAACGCTGC GGGTACAGGG CGCGGTGCGT GGGAATGGCC GGGGACGGTT TATTTTGCAG
GACGGGTGTC AGGTGGTGGG GGAGCAGGGC GGCAGTCTGC ACAATGTGAC GCTGGATGTT
CGCGGGTCGG ACTGTGTGAT TAAAGGCGTG ACGATGAGCG GCTTTGGCCC CGTCGCGCAA
ATTTTCATCG GCGGTAAGGA ACCGCAGGTG ATGCGTAATC TCATTATCGA TGACATCACC
GTTACCCACG CCAACTACGC CATTCTCCGC CAGGGATTTC ATAACCAAAT GGACGGCGCG
CGGATTACGC ATAGCCGCTT TAGCGATTTG CAGGGGGACG CCATTGAGTG GAATGTCGCG
ATTCACGACC GCGACATCCT GATTTCCGAT CATGTCATCG AACGCATTGA TTGTACCAAT
GGCAAAATCA ACTGGGGGAT CGGCATCGGG CTGGCGGGTA GCACCTATGA CAACAGTTAT
CCTGAAGATC AGGCAGTAAA AAACTTTGTG GTGGCCAATA TTACCGGATC TGATTGCCGA
CAGCTGGTGC ACGTAGAAAA TGGCAAACAT TTCGTCATTC GCAATGTCAA AGCCAAAAAC
ATCACGCCCG ATTTCAGTAA AAATGCGGGT ATTGATAACG CAACGATCGC CATTTATGGC
TGTGATAATT TCGTCATTGA TAATATTGAT ATGACGAATA GTGCTGGGAT GCTCATCGGC
TATGGCGTCG TTAAAGGAAA ATACCTGTCA ATTCCGCAAA ACTTTAAATT AAACGCTATT
CGGTTGGATA ATCGCCAGGT TGCTTATAAA TTACGCGGCA TTCAAATTTC CTCCGGCAAC
ATCCCCTCTT TTGTCGCCAT CACCAATGTA CGGATGACGC GTGCTACGCT GGAACTGCAT
AATCAACCGC AGCACCTCTT TCTGCGTAAT ATCAACGTGA TGCAAACTTC AGCGATTGGC
CCGGCGTTAA AAATGCATTT CGATTTGCGT AAAGATGTCC GTGGTCAATT TATGGCCCGC
CAGGACACGC TGCTTTCCCT CGCTAATGTT CATGCCATCA ATGAAAACGG GCAGAGTTCC
GTGGATATCG ACAGGATTAA TCACCAAACC GTGAATGTCG AAGCAGTGAA TTTTTCGCTG
CCGAAGCGAG GAGGGTAA
 
Protein sequence
MNKFSRRTLL TAGSALAVLP FLRALPVQAR EPRETVDIKD YPADDGIASF KQAFADGQTV 
VVPPGWVCEN INAAITIPAG KTLRVQGAVR GNGRGRFILQ DGCQVVGEQG GSLHNVTLDV
RGSDCVIKGV TMSGFGPVAQ IFIGGKEPQV MRNLIIDDIT VTHANYAILR QGFHNQMDGA
RITHSRFSDL QGDAIEWNVA IHDRDILISD HVIERIDCTN GKINWGIGIG LAGSTYDNSY
PEDQAVKNFV VANITGSDCR QLVHVENGKH FVIRNVKAKN ITPDFSKNAG IDNATIAIYG
CDNFVIDNID MTNSAGMLIG YGVVKGKYLS IPQNFKLNAI RLDNRQVAYK LRGIQISSGN
IPSFVAITNV RMTRATLELH NQPQHLFLRN INVMQTSAIG PALKMHFDLR KDVRGQFMAR
QDTLLSLANV HAINENGQSS VDIDRINHQT VNVEAVNFSL PKRGG