Gene SeHA_C3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3559 
Symbol 
ID6491843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3449997 
End bp3452039 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content59% 
IMG OID642743682 
ProductLppC superfamily 
Protein accessionYP_002047296 
Protein GI194449109 
COG category[R] General function prediction only 
COG ID[COG3107] Putative lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACCCT CAACATTTTC TCGTTTGAAC GCCGCGCGCG CGCTGCCTGT CGTCCTGGCT 
GCGCTACTTT TCGCCGGGTG CGGCACCCAG GCGCCAGATC AAAGCGCAGC CTATATGCAG
GGTTCAGCGC AAGCTGACTC CGCCTTTTAC CTGCATCAGA TGCAGCAAAG CGCAGATGAT
AGCAAGACCA ACTGGCAATT ACTCGCCATT CATGCACTGC TGAAAGAAGG AAAAAGCCAG
CAGGCCGTCG ACCTGTTCAA CCAACTCCCG CAAAATCTGA ACGATACCCA GCGTCGCGAA
CAGTCTTTAT TAGCGGTAGA AATCAAACTG GCGCAAAAAG ATGTCGCAGG CGCGCAGGCC
TTGCTGGATA AACTAAAACC CGCCGACTTT GCGCCACATC AGCAAGCGCG TTACTGGCAG
GCGCAGATCG TTGCCAGCCA GGGACGCCCG TCGCTTACCC TGTTGCGGGC GTTAATCGCC
CAGGAACCGC TACTGGCGGC GAAAGATAAA CAAAAAAATA TCGACGCCAC CTGGCAGGCG
CTCTCCGCCA TGACGCCGGA TCAGGCCAGG ACGCTGGTTA TCAACGCCGA TGAAAATGTG
CTTCAGGGCT GGCTGGATCT GCAACGCGTC TGGTTTGACA ACCGCAACGA TCCGGACATG
CTGAAAGCCG GGATCGCCGA CTGGCAAAAA CGCTACCCGC AAAATCCGGG GGCGAAAATG
CTGCCGACGC AGCTCGTCAA TGTACAACGT TTCAAACCGG CTTCCACCAG CAAAATCGCT
CTGCTGCTGC CGCTGAACGG TCAGGCTGCC GTGTTTGGCC GTACCATCCA GCAAGGTTTC
GAAGCCGCGA AAAACCTCGG CACCCAGGCG GTAGAGATGC AGCCTGCCGC CGCGCCTGAC
GCGCCGGTAG AACCTGGCGT GGAGGAGACG CAGCCACAAA TGACCAACGG CGTCGCCAGT
CCGTCGCAGG CCTCGGTGAG CGATCTGACT GATGACGCTC CATCCCAGTC CGCTACGCCA
GTCAGCGCGC CACAAACTCC CCCTGCTACA GCAAGCGCGC CAGCGGATCC CTCCGCTGAA
TTAAAAATCT ACGATACCTC TTCCCAGCCG TTGGATCAGG TGCTTGCTCA GGTTCAGCAA
GACGGCGCCA GTATCGTGGT CGGGCCGCTG TTGAAAAACA ATGTGGAAGC GCTGATGAAA
AGCAACACGC CGCTCAACGT GCTGGCGCTC AACCAGCCGG AAACGGTACG TAGCTTCCCT
AATATCTGCT ATTTCGCGCT CTCTCCAGAA GATGAAGCCC GTGATGCGGC GCATCATATT
TATGACCAGG GCAAGCAGTC GCCGCTGCTG TTGATCCCAC GCAGCACGCT TGGCGATCGC
GTGGCGAACG CCTTCACCCA AGAGTGGCAA AAACTGGGCG GCGGCATCGT GTTACAGCAA
AAATTCGGCT CCGTAGCCGA GCTGAAAATG GGCGTGAACG GCGGCGCGGG TATCGCGTTG
ACGGGCAGCC CGGTCGCCGC CAGCGTGCCT GCGCAGCCTG GCGTCACCAT TGGCGGTCTG
ACTATCCCTG CGCCGCCGAC CGACGCGCAA ATCACCGGCG GCGGACGCGT AGACGCGGTC
TATATTCTGG CTACGCCGGA AGAGATTGGC TTTATCAAAC CGATGATCGC CATGCGTAAC
GGCACCCAGA GCGGCGCGAC GCTGTATGCC AGCTCTCGCA GCGCGCAAGG CACCTCCGGC
CCTGACTTCC GTCTGGAGAT GGAAGGTTTG CAATACAGTG AAATTCCCAT GCTGGCAGGC
GGCAATATGC CGTTGATGCA GCAGGCGCTG AGCGCTGTAC ATAACGACTA TTCTCTGGCG
CGGATGTACG CCATGGGCGT GGATGCCTGG ACGCTGGCGA ACCACTTTTC GCAGATGCGT
CAGGTGCAGG GGTTTGAGAT CAATGGTAAT ACCGGCGCAT TAACCGCCAG CCCGGATTGT
GTGATTAACA GGAAGTTATC ATGGCTCAAA TACCAGCAAG GGGAGATTGT TCCCGCCAGC
TAA
 
Protein sequence
MVPSTFSRLN AARALPVVLA ALLFAGCGTQ APDQSAAYMQ GSAQADSAFY LHQMQQSADD 
SKTNWQLLAI HALLKEGKSQ QAVDLFNQLP QNLNDTQRRE QSLLAVEIKL AQKDVAGAQA
LLDKLKPADF APHQQARYWQ AQIVASQGRP SLTLLRALIA QEPLLAAKDK QKNIDATWQA
LSAMTPDQAR TLVINADENV LQGWLDLQRV WFDNRNDPDM LKAGIADWQK RYPQNPGAKM
LPTQLVNVQR FKPASTSKIA LLLPLNGQAA VFGRTIQQGF EAAKNLGTQA VEMQPAAAPD
APVEPGVEET QPQMTNGVAS PSQASVSDLT DDAPSQSATP VSAPQTPPAT ASAPADPSAE
LKIYDTSSQP LDQVLAQVQQ DGASIVVGPL LKNNVEALMK SNTPLNVLAL NQPETVRSFP
NICYFALSPE DEARDAAHHI YDQGKQSPLL LIPRSTLGDR VANAFTQEWQ KLGGGIVLQQ
KFGSVAELKM GVNGGAGIAL TGSPVAASVP AQPGVTIGGL TIPAPPTDAQ ITGGGRVDAV
YILATPEEIG FIKPMIAMRN GTQSGATLYA SSRSAQGTSG PDFRLEMEGL QYSEIPMLAG
GNMPLMQQAL SAVHNDYSLA RMYAMGVDAW TLANHFSQMR QVQGFEINGN TGALTASPDC
VINRKLSWLK YQQGEIVPAS