Gene SeHA_C2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2449 
Symbol 
ID6487767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2358994 
End bp2360082 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content54% 
IMG OID642742632 
Productcobalamin synthesis protein, P47K 
Protein accessionYP_002046267 
Protein GI194451312 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.566188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0000000416664 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCAGAAG CACGACTTCG TCAACTCAGC ACTTCTCTTT TTCACGTTTA TTCGCCAGGA 
TTATTAAAAG TTGTTATGTT GTATCAATAT TCTGGAGCTG ACGTGACCAA AACCAATCTT
ATTACTGGAT TTCTCGGTAG CGGAAAAACC ACCTCTATCC TTCATTTATT AGCTCATAAA
GATCCGGCTG AAAAGTGGGC CGTCCTGGTT AATGAATTTG GTGAAGTGGG TATTGACGGC
GCGCTGCTTG CCGATAGCGG CGCACTGCTA AAAGAGATCC CCGGCGGCTG CATGTGCTGC
GTCAACGGAT TGCCTATGCA GGTGGGGCTC AACACGCTGC TGCGCCAGGG CAAACCTGAC
CGGTTGCTGA TTGAACCAAC CGGACTGGGA CACCCAAAAC AGATTCTGGA TTTATTAACT
GCGCCGGTTT ATGAGCCGTG GATTGATTTA CGCGCCACGC TCTGCATCCT TGACCCTCGT
CTGCTACTGG ACCAACAGAG TGTCGCCAAT GAAAATTTCC GCGATCAGCT CGCCTCAGCC
GATATTATCA TCGCCAATAA GACCGATCGC GCCACGGCGC AGAGCGATGC CGCCCTGCAA
CAGTGGTGGC GACAGTACGG CGGCGATCGT CAACTGATTC ATGCCGAACA TGGACAGATA
GACGATAAGC TTCTGGATTT ACCGCGACAA AATCTGGCGG AACTGCCGGC CAGCGCCGCG
CATTCTCACA CTCATGCCAG TAAAAAAGGA CTCGCCGCGC TAAATCTGCC CGCACAGCAG
CGCTGGCGAC GCAGCCTCAA TAGCGGACAG GGTCATCAGG CCTGCGGCTG GATTTTCGAT
GCCGATACCG TGTTTGACAC CATTGGCCTC CTCGAATGGG CGCGTCTGGC GCCGGTGGGC
CGGGTGAAAG GCGTTATGCG CATACAAGAG GGGCTGGTAC GCATCAATCG CCAGGGCGAT
GACCTGCACA TCGAAACACA GAGTGTCGCG CCGCCGGATA GCCGGGTTGA ACTTATCTCA
AACACAGAAA CCGACTGGAA TACGTTACAG ACGGCCTTGT TGAAGCTTCG TTTAGCGACG
CACGCGTAA
 
Protein sequence
MAEARLRQLS TSLFHVYSPG LLKVVMLYQY SGADVTKTNL ITGFLGSGKT TSILHLLAHK 
DPAEKWAVLV NEFGEVGIDG ALLADSGALL KEIPGGCMCC VNGLPMQVGL NTLLRQGKPD
RLLIEPTGLG HPKQILDLLT APVYEPWIDL RATLCILDPR LLLDQQSVAN ENFRDQLASA
DIIIANKTDR ATAQSDAALQ QWWRQYGGDR QLIHAEHGQI DDKLLDLPRQ NLAELPASAA
HSHTHASKKG LAALNLPAQQ RWRRSLNSGQ GHQACGWIFD ADTVFDTIGL LEWARLAPVG
RVKGVMRIQE GLVRINRQGD DLHIETQSVA PPDSRVELIS NTETDWNTLQ TALLKLRLAT
HA