Gene SeHA_C2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2999 
Symbol 
ID6491370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2939065 
End bp2940435 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content47% 
IMG OID642743155 
Productputative glycoporin 
Protein accessionYP_002046779 
Protein GI194448709 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.419579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.000157267 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGCTA AATATTTGGC GCTGATGATC GGTGCTTGCT TTTCTCATAA CCTTTGGGCA 
GCGAATAATA TCACTATTGA GCAGCGTCTG GCTGAACTGG AGCAACGTGT TGTTAATGCT
GAAAAACGGG CATCCGATGC CGAGGCGCAA ATTCGCTCGT TGAAACAGCA GCAGGTCGCC
GCTACGCCGA TGGTGAATGT CCAGTCTGCC GAGCCCATTG CAGCAGGTAA AACACCGCCG
AAGCTGACCT TATCCGGATT CAGCGATATT AAGTTCTATG GCGATGTCGA ATTTAATATG
GATGCGGCAA GCCGTTCCGG TAGTCTGACA TCGACGAGAA CGTCAGCGAA TAAAGATTGG
GCACCGGGAA CCAATGAACG CTGGGATATT AACGGACGCC TGTTGCTGGG CTTTGATGGC
TACCAGCGGC TGGACAACGG TAATTTTGCC GGATTCTCTG TACAGCCTCT GGCGGACCTG
ACCGGAAAAA TGAACCTTGA TGATGCCGTT TTCTTCTTTG GTCGTGAGAA TGACTGGAAA
ATTAAGGTTG GTCGTTTTGA AGCCTACGAT ATGTTCCCAC TGAATCAGGA TACGTTTATT
GAATATTCGG GGAATACAGC GAACGATCTT TACAGTGACG GTTACGGCTA TATCTATATG
ATGAAAGAAG GACGGGGACG TAGCGACAGT GGGGGTAACT TCCTGCTGAG TAAAACCATC
GACAACTGGT ATTTCGAAGT TAACACATTG CTGGAAAATG GCAGTACGTT ATATACCGAG
AAGCAGTACC ACGGAATGGA TTTGAGCAAC GATAAAAATG TGGCTTACGT CCGTCCGGTT
ATCGCCTGGC AAAACGGGCG TTTTTCAACG GCGATAGCGA TGGAAAGTAA CGTCGTTAAC
AACGCCTATG GCTATTATGA GAATGGGAAG TGGATCGATC AGTCAGATCG TACGGGCTAT
GGTTTTACCA TGACCTGGAA TGGTCAAAAA ACTGACCCGG AAGATGGCGC AGTGATTAAC
CTGAATACCG CCTATATGGA TGCGACCGAT GAGACAGATT TTACCGCTGG GGTGAATGCG
CTGTGGCATC GATTTGAACT GGGTTATATC TATGCGCATA ACAAAATCGA AGCCTTCAAT
GCTACTAATA TCGATGCCGT TTGTGAGGAC GATTGCTGGA TCACCGATCC CGGCAATTAT
GATATTCACA CTATTCATGC CTCATATTTA TTCCCCAACG TGATGGATAT GAAAAACTTT
AACATCTACC TCGGTGCCTA TGCTTCATGG GTAGAGGCCA ATCCGAATAA TGGCGATAAC
AGTGAAGATG CGCGTTACGG CGGGCGTCTG AGATTCAAAT ATTTCTTCTG A
 
Protein sequence
MKAKYLALMI GACFSHNLWA ANNITIEQRL AELEQRVVNA EKRASDAEAQ IRSLKQQQVA 
ATPMVNVQSA EPIAAGKTPP KLTLSGFSDI KFYGDVEFNM DAASRSGSLT STRTSANKDW
APGTNERWDI NGRLLLGFDG YQRLDNGNFA GFSVQPLADL TGKMNLDDAV FFFGRENDWK
IKVGRFEAYD MFPLNQDTFI EYSGNTANDL YSDGYGYIYM MKEGRGRSDS GGNFLLSKTI
DNWYFEVNTL LENGSTLYTE KQYHGMDLSN DKNVAYVRPV IAWQNGRFST AIAMESNVVN
NAYGYYENGK WIDQSDRTGY GFTMTWNGQK TDPEDGAVIN LNTAYMDATD ETDFTAGVNA
LWHRFELGYI YAHNKIEAFN ATNIDAVCED DCWITDPGNY DIHTIHASYL FPNVMDMKNF
NIYLGAYASW VEANPNNGDN SEDARYGGRL RFKYFF