Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2449 |
Symbol | |
ID | 6487767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 2358994 |
End bp | 2360082 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642742632 |
Product | cobalamin synthesis protein, P47K |
Protein accession | YP_002046267 |
Protein GI | 194451312 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.566188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0000000416664 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCAGAAG CACGACTTCG TCAACTCAGC ACTTCTCTTT TTCACGTTTA TTCGCCAGGA TTATTAAAAG TTGTTATGTT GTATCAATAT TCTGGAGCTG ACGTGACCAA AACCAATCTT ATTACTGGAT TTCTCGGTAG CGGAAAAACC ACCTCTATCC TTCATTTATT AGCTCATAAA GATCCGGCTG AAAAGTGGGC CGTCCTGGTT AATGAATTTG GTGAAGTGGG TATTGACGGC GCGCTGCTTG CCGATAGCGG CGCACTGCTA AAAGAGATCC CCGGCGGCTG CATGTGCTGC GTCAACGGAT TGCCTATGCA GGTGGGGCTC AACACGCTGC TGCGCCAGGG CAAACCTGAC CGGTTGCTGA TTGAACCAAC CGGACTGGGA CACCCAAAAC AGATTCTGGA TTTATTAACT GCGCCGGTTT ATGAGCCGTG GATTGATTTA CGCGCCACGC TCTGCATCCT TGACCCTCGT CTGCTACTGG ACCAACAGAG TGTCGCCAAT GAAAATTTCC GCGATCAGCT CGCCTCAGCC GATATTATCA TCGCCAATAA GACCGATCGC GCCACGGCGC AGAGCGATGC CGCCCTGCAA CAGTGGTGGC GACAGTACGG CGGCGATCGT CAACTGATTC ATGCCGAACA TGGACAGATA GACGATAAGC TTCTGGATTT ACCGCGACAA AATCTGGCGG AACTGCCGGC CAGCGCCGCG CATTCTCACA CTCATGCCAG TAAAAAAGGA CTCGCCGCGC TAAATCTGCC CGCACAGCAG CGCTGGCGAC GCAGCCTCAA TAGCGGACAG GGTCATCAGG CCTGCGGCTG GATTTTCGAT GCCGATACCG TGTTTGACAC CATTGGCCTC CTCGAATGGG CGCGTCTGGC GCCGGTGGGC CGGGTGAAAG GCGTTATGCG CATACAAGAG GGGCTGGTAC GCATCAATCG CCAGGGCGAT GACCTGCACA TCGAAACACA GAGTGTCGCG CCGCCGGATA GCCGGGTTGA ACTTATCTCA AACACAGAAA CCGACTGGAA TACGTTACAG ACGGCCTTGT TGAAGCTTCG TTTAGCGACG CACGCGTAA
|
Protein sequence | MAEARLRQLS TSLFHVYSPG LLKVVMLYQY SGADVTKTNL ITGFLGSGKT TSILHLLAHK DPAEKWAVLV NEFGEVGIDG ALLADSGALL KEIPGGCMCC VNGLPMQVGL NTLLRQGKPD RLLIEPTGLG HPKQILDLLT APVYEPWIDL RATLCILDPR LLLDQQSVAN ENFRDQLASA DIIIANKTDR ATAQSDAALQ QWWRQYGGDR QLIHAEHGQI DDKLLDLPRQ NLAELPASAA HSHTHASKKG LAALNLPAQQ RWRRSLNSGQ GHQACGWIFD ADTVFDTIGL LEWARLAPVG RVKGVMRIQE GLVRINRQGD DLHIETQSVA PPDSRVELIS NTETDWNTLQ TALLKLRLAT HA
|
| |