Gene Sala_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2002 
Symbol 
ID4082167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2111214 
End bp2112320 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content72% 
IMG OID638010378 
ProductPhage portal protein, HK97 
Protein accessionYP_617046 
Protein GI103487485 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.443505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.004041 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACTGGT TTGGCCGGAA GGCTGCGCAG GGGGCTGCGC GGCCCGCTTT GTCGCGGGTG 
TATGGGAGCT GGTCGGCGCC TGCGCCGCTG TCGTGGGAAG CGCAGGTGCG CGAGGGGTAT
CTGGCGAATG CGATCGTGCA GCGCAGCGTG CGGCTGGTGG CCGAGGCGGC GGCGAGTGCG
CCGCTGGAGG CGAGCGATCC GGGGCTCTTG GCGCTGGTTT CGGCGACGTC GGGCGGGCAG
GGGCTGCTTG AGACGCTGGC GTCGCACCTG TTGCTGCACG GCAATGGCTA TGTGCAGATT
TTGACCGATG GCGCGGGGGC GCCGGCCGAG CTGTTCGCGC TGCGCCCCGA GCGGGTGACG
GTCGAGGCCG ACGCGCGCGG GTGGCCGGTG GCCTATCGCT ACAAGGCGGG CGGGTCGGCG
GCGGTCCTGC CCGCCGAGGA TGGCGCGGGG CGCGTCGCGG TGGTGCATGT GAAGGCGCTG
CATCCGCTCG ACGATCATTA TGGCGCGGGG TGCCTGGGCG CCGCGGCGGG GGCGATCGCG
GCGCATAATG CGGCGGCGAA GTGGAATGCG GCGCTGCTGG AGAATGCGGC GCGGCCGTCG
GGGGCGCTGG TCCATGATCC GGGCGACAAG GGGATGCCGC TGTCGGCCGA GCAGGTCGAG
CGGCTGCGCG AGGAACTGGC CGAGAGTTTT TCGGGGCGTG CCAATGCCGG GCGGCCCTTG
CTGCTGGAGG GTGGCCTCCG GTGGCAGGCG CTGTCGCTGT CGCCCGCCGA GATGGATTTC
CTGGCGCTGA AGGATTCGAG CGCGCGCGAG ATTGCGATGG CGTTCGGGGT GCCGCCGATG
CTGCTGGGGC TGCCGGGGGA CGCGACCTAT GCCAATTATC GCGAGGCCAA TCGCGCGCTG
TGGCGGCTGA CGGTGCTGCC TTTGTGCGCC AAGATATTGG GGGCGATCGC GCAGGGGCTG
TCGGGCTGGT TCGACGGCGC CGAGCTGCGC GTCGACCTCA ACAAGCTGCC CGCGCTGGCC
GAGGACCGGA TGGCGCTGTG GCGCGAGGTG TCGGGTGCCG ACTGGCTGAG CGCGGACGAG
AAGAAGGCGC TGCTGGGGGT GGCGTAG
 
Protein sequence
MNWFGRKAAQ GAARPALSRV YGSWSAPAPL SWEAQVREGY LANAIVQRSV RLVAEAAASA 
PLEASDPGLL ALVSATSGGQ GLLETLASHL LLHGNGYVQI LTDGAGAPAE LFALRPERVT
VEADARGWPV AYRYKAGGSA AVLPAEDGAG RVAVVHVKAL HPLDDHYGAG CLGAAAGAIA
AHNAAAKWNA ALLENAARPS GALVHDPGDK GMPLSAEQVE RLREELAESF SGRANAGRPL
LLEGGLRWQA LSLSPAEMDF LALKDSSARE IAMAFGVPPM LLGLPGDATY ANYREANRAL
WRLTVLPLCA KILGAIAQGL SGWFDGAELR VDLNKLPALA EDRMALWREV SGADWLSADE
KKALLGVA