Gene SeD_A2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2363 
Symbol 
ID6871099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2237147 
End bp2238202 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content55% 
IMG OID642785455 
Productcobalamin biosynthesis protein CbiG 
Protein accessionYP_002216113 
Protein GI198242524 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2073] Cobalamin biosynthesis protein CbiG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.518108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.79543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACCG TAAAGCCTGA ATCCATTGCG TTGTTTTGCC TGACGCCCGG CGGCGTGGCG 
CTGGCAAAGA GACTCTCCGC GATGCTGCCG TTAACCTGCT TTACCAGTGA AAAACTGCGG
GAAGAGGGAT TTATTCCCTT CGATGGCGGA TTCGCTAATA CCGCCCGGCA GGCTTTTACC
ACTTATACCG CGCTTATTTT TATCGGCGCG ACCGGCATTG CCGTTCGTGT CCTGGCGCCG
TTAGTGAACG ACAAGTTCAG CGATCCGGCG GTGGTCGTCA TTGATGAACG AGGTCAGCAT
GTCATTAGCC TGCTTTCCGG TCATGCGGGC GGGGCCAATG CCTTGACGCG CTACCTGGCA
GGAATGTTAG GCGCCGATCC GGTGATTACC ACGGCAACGG ATGTCAATGA GATGTCCGCG
TTGGACACCT TAGCTTTCCA GCTTAACGCC CGCATGTCCG ATCTTCGCAC GGCGGTAAAA
ACCGTTAACC AGATGTTGGT CAGTCATCAA CGTGTGGGGT TATGGTGGGA TGCCGAACTA
ACGGAAGAGA TCGGCCAGTG CGATATTCGC GGTTTTATCC CTGTTGATGA TTTGCAGCGG
TTGCCTGAGC TGGATGCGCT TATCTGCGTC TCTTTGCGTA ATGACCTCCC TGAGCTTCCC
GTACTGCACT GGAAACTGGT GCCCCAGCGT GTGGTGGCGG GAATTGGCTG TCGCCGCGAT
ACGCCATTTC CCCTGTTAGC GACATTACTG GCGCGTCAGC TTGAAGCGCA GAAACTCGAT
CCGCTGGCGT TAAAAGCGAT TGGCAGCGTC ACGCTCAAAA AAGGGGAGCC GGGGCTTATT
CAGCTCGCCT CCTGCTGCCG CGTGCCTTTT AAAACCTTTA CCGCCGAAGC GTTGCGTGAA
TTCGAACACC ATTTTCCCGG TTCTGGCTTC GTCAGAAAAA CGGTGGGCGT TGGCAGCGTA
TCCGGCCCGG CAGCGTGGTT ATTAAGCCAG GGACAATTGT TAGGCGAGAC CCTGCGAGAA
CAGGGCGTCA CTATTACTTT GGGAATTTCA CACTGA
 
Protein sequence
MNTVKPESIA LFCLTPGGVA LAKRLSAMLP LTCFTSEKLR EEGFIPFDGG FANTARQAFT 
TYTALIFIGA TGIAVRVLAP LVNDKFSDPA VVVIDERGQH VISLLSGHAG GANALTRYLA
GMLGADPVIT TATDVNEMSA LDTLAFQLNA RMSDLRTAVK TVNQMLVSHQ RVGLWWDAEL
TEEIGQCDIR GFIPVDDLQR LPELDALICV SLRNDLPELP VLHWKLVPQR VVAGIGCRRD
TPFPLLATLL ARQLEAQKLD PLALKAIGSV TLKKGEPGLI QLASCCRVPF KTFTAEALRE
FEHHFPGSGF VRKTVGVGSV SGPAAWLLSQ GQLLGETLRE QGVTITLGIS H