Gene Sala_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1999 
Symbol 
ID4082164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2109026 
End bp2110168 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content68% 
IMG OID638010375 
Productphage major capsid protein, HK97 
Protein accessionYP_617043 
Protein GI103487482 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00431544 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAGTGG ATATGGAAGT GAAGGCCGAT GCGCTGGACG GCGCGTTCGA TGCGGTGCTG 
GCGGCGGAGG CCGTCGATGA GCTGAAGGCG TCGGTTGCGG CGCTGAAGGC ACAGGTCGAT
GCCCAGGCGG TCGCGGCGGC GCGGTTGCCG CTCGACGGGG CGAAGGCGGC CGATCCGGCG
CTGGACGCCT TTGTCGAACG TTATCTGCGG CGCGGGATCG ACGCCGGGGT GGAGATGAAG
AGCCTGTCGG GGGCGAGCGG AGCCGAGGGC GGCTATGCGG TGCCGCGCGA GATCGACACC
AGCATCGCCG CGACGCTGAA ATCGCTGTCG CCGATCCGCA GCATCGCGAC CGTGGTGCAG
ACAGGGACGA GCGGGTACCG GAAGCTGATC GCGACGGGCG CGACGGGCGC GGGCTGGGTC
GGCGAAAGCG ACGCGCGGCC CGAGACGGCG ACGCGCAGCT TTGCCGAGAT CGCGCCGCCG
TCGGGCGAGC TTTACGCCAA TCCGGCGGCG AGCCAGGCGA TGCTCGACGA TGCGATGTTC
AACGTCGAGG CCTGGCTGGC CGACGAGATC GGGCGCGAGT TCGCGGTCGC CGAAGGGGCG
GCGTTCGTGA CCGGCAACGG CACGAACCGG CCCAGGGGAT TCCTGACCTA TGCGACGAGC
GACGAGGGTG ACGGTGCGCG GCCGTTCGGC ACGTTGCAGC ATCTGGCGAC GGGCAGCGCG
GGCGCCTTTC CGGCGGTGAA CCCTGAGGAC AGACTGGTCG AGCTGGTCCA TGCGCTGAAA
GCTCCGTACC GGCAGGGCGC GGTGTGGGTG ATGAACAGCG ATACGCTGGC GCGCATCCGC
AAGTTCAAGA CGTCGGACGG CGCCTTCGTC TGGCAGCCGG GGCTGGTCGA GGGACAGGCG
GCGAGCCTGC TCGGCTATCC GGTCGTCGAG GCCGAGGACA TGCCCGATAT TGCGGCCGAC
AGCCTGTCGA TCGCCTTCGG CAATTTCCGC GCGGGCTATC TGATCGCCGA CCGCGGCGAG
ACGCGCATCC TGCGCGATCC GTTCAGCAAC AAGCCCTTCG TGCATTTCTA TGCAACCAAA
AGGGTCGGCG GCGCGATCAT CGATTCGCAG GCGATCAAGC TGATGAAATT CGCCGCCAGC
TGA
 
Protein sequence
MEVDMEVKAD ALDGAFDAVL AAEAVDELKA SVAALKAQVD AQAVAAARLP LDGAKAADPA 
LDAFVERYLR RGIDAGVEMK SLSGASGAEG GYAVPREIDT SIAATLKSLS PIRSIATVVQ
TGTSGYRKLI ATGATGAGWV GESDARPETA TRSFAEIAPP SGELYANPAA SQAMLDDAMF
NVEAWLADEI GREFAVAEGA AFVTGNGTNR PRGFLTYATS DEGDGARPFG TLQHLATGSA
GAFPAVNPED RLVELVHALK APYRQGAVWV MNSDTLARIR KFKTSDGAFV WQPGLVEGQA
ASLLGYPVVE AEDMPDIAAD SLSIAFGNFR AGYLIADRGE TRILRDPFSN KPFVHFYATK
RVGGAIIDSQ AIKLMKFAAS