Gene SeHA_C3491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3491 
Symbol 
ID6490816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3387557 
End bp3388579 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content53% 
IMG OID642743620 
Productphage major capsid protein, P2 family 
Protein accessionYP_002047234 
Protein GI194451187 
COG category 
COG ID 
TIGRFAM ID[TIGR01551] phage major capsid protein, P2 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value1.35706e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACCTTA ATAACCGTGC GCGGGAATTA CTGGACGGAT ATTCGGCGGG CATGGCGCAG 
CAGTTTGGGG CGCGTGATGC CAGTCGTTAT TTTTCCCTGA ATAACCCGCA GGAAAATGCG
CTGCGTCTTG CGCTGCTGGA ATCCGTCGAA TTCCTGGACA TGCTTACCTG TCTGGATGTT
GATCAGCTGA GTGGCCAGGT GATTTCCGTT GGTTCTTCCG TATTACACAC AGGACGTAGT
GAAAGTGGCC GTTTTATTCG CCAGGTTGGT GTGGACGGAA ACGACTATTC ACTGGTGGAA
ACAGACAGCT GCGCCGCGTT GCGCTGGGAT CTGCTTTCGG TCTGGGCAAA CGCCGGTAAA
GATGAAAACG AGTTTTACAA CCTTGTCCAG GCATTTACCA CGCAGGCTTT TGCACTGGAT
ATGTTGCGTA TCGGCTTTAA CGGTAAGAGC CGCGCAAAAA CCACTGATCC CGAAGCTAAC
CCGAACGGTG AAGATGTGAA TATCGGCTGG CATGAGCGCA TGAAAACGCT GCTGGGCGGC
AATCAGATTA TGACCGATCC GGTGGTGCTG GATGCAGCCG GGGATTACAA ATCACTGGAT
GCAATGGCGT CAGACCTGAT TAACGCCAAA ATTCCGGCGC AGTTCCGCAA TGACCCGCGT
CTGGTGGTTC TGGTAGGGGC TGATCTGGTT GCAGCTGAAC AGTATCGCCT GTATCAGGCC
GCAGACCGTC CGACTGAAAA AATCGCAGCG CAGTTGCTGG GGAATACCAT TGCTGGCCGT
CCGGCCATTA TCCCGCCTTT TATGCCGGGA AAACGCATGG TGGTGACGCC GCTGAAAAAT
CTGCACATCT ATACCCAGCG CAATACCCGT ATGCGTAAGG CGGAGTTTGT GGAAGACCGT
AAGCAGTTTG AAAACAAATA CCTGCGCAAT GAAGGATATG CGGTGGAAGT GCCGGAACTG
TATGCGGCCA TTGATGAATC CGCCGTAACT ATCGGCAAGG TTTCCGAACC AGCGGAGGGC
TGA
 
Protein sequence
MHLNNRAREL LDGYSAGMAQ QFGARDASRY FSLNNPQENA LRLALLESVE FLDMLTCLDV 
DQLSGQVISV GSSVLHTGRS ESGRFIRQVG VDGNDYSLVE TDSCAALRWD LLSVWANAGK
DENEFYNLVQ AFTTQAFALD MLRIGFNGKS RAKTTDPEAN PNGEDVNIGW HERMKTLLGG
NQIMTDPVVL DAAGDYKSLD AMASDLINAK IPAQFRNDPR LVVLVGADLV AAEQYRLYQA
ADRPTEKIAA QLLGNTIAGR PAIIPPFMPG KRMVVTPLKN LHIYTQRNTR MRKAEFVEDR
KQFENKYLRN EGYAVEVPEL YAAIDESAVT IGKVSEPAEG