Gene SeD_A2563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2563 
Symbol 
ID6875019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2442035 
End bp2443021 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content56% 
IMG OID642785638 
Productcobalamin synthesis protein, P47K 
Protein accessionYP_002216296 
Protein GI198242709 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00261863 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00000585707 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCAAAA CCAATCTTAT TACTGGATTT CTCGGTAGCG GAAAAACCAC CTCTATCCTT 
CATTTATTAG CTCATAAAGA TCCGGCTGAA AAGTGGGCCG TCCTGGTTAA TGAATTTGGT
GAAGTGGGTA TTGACGGCGC GCTGCTTGCC GACAGCGGCG CACTGCTAAA AGAGATCCCC
GGCGGCTGCA TGTGCTGCGT CAATGGATTG CCTATGCAGG TGGGGCTCAA CACGCTGCTG
CGCCAGGGCA AACCTGACCG GTTGCTGATT GAACCAACCG GACTGGGGCA CCCAAAACAG
ATTCTGGATT TATTAACTGC GCCGGTTTAT GAGCCGTGGA TTGATTTACG CGCCACGCTC
TGCATCCTGG ACCCTCGCCT GCTACTGGAC CAACAGAGCG TCGCCAATGA AAATTTCCGC
GATCAGCTCG CCTCAGCCGA TATTATCATC GCCAATAAAA CCGACCGCGC CACGGCGCAG
AGCGATGCCG CCCTGCAACA GTGGTGGCGA CAGTACGGCG GCGATCGTCA ACTGATTCAT
GCCGAACATG GACAGATAGA CGGTAAGCTT CTGGATTTAC CGCGACAAAA TCTGGCGGAA
CTGCCGGCCA GCGCCGCGCA TTCTCACACT CATGCCAGTA AAAAAGGACT CGCCGCGCTA
AATCTGCCCG CCCAGCAGCG CTGGCGACGC AGCTTCAATA GCGGACAGGG TCATCAGGCC
TGCGGCTGGA TTTTCGATGC CGATACCGTG TTTGACACCA TTGGCCTCCT CGAATGGGCG
CGTCTGGCGC CGGTGGGCCG GGTGAAAGGC GTTATGCGCA TACAAGAGGG GCTGGTACGC
ATCAATCGCC AGGGCGATGA CCTGCACATC GAAACACAGA GTGTCGCGCC GCCGGATAGC
CGGGTTGAAC TTATCTCAAA CACAGAAACC GACTGGAATA CGTTACAGAC GGCCTTGTTG
AAGCTTCGTT TAGCGACGCA CGCGTAA
 
Protein sequence
MTKTNLITGF LGSGKTTSIL HLLAHKDPAE KWAVLVNEFG EVGIDGALLA DSGALLKEIP 
GGCMCCVNGL PMQVGLNTLL RQGKPDRLLI EPTGLGHPKQ ILDLLTAPVY EPWIDLRATL
CILDPRLLLD QQSVANENFR DQLASADIII ANKTDRATAQ SDAALQQWWR QYGGDRQLIH
AEHGQIDGKL LDLPRQNLAE LPASAAHSHT HASKKGLAAL NLPAQQRWRR SFNSGQGHQA
CGWIFDADTV FDTIGLLEWA RLAPVGRVKG VMRIQEGLVR INRQGDDLHI ETQSVAPPDS
RVELISNTET DWNTLQTALL KLRLATHA