Gene Smal_2500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_2500 
Symbol 
ID6476989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp2806403 
End bp2807653 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content65% 
IMG OID642731686 
Productphage major capsid protein, HK97 family 
Protein accessionYP_002028884 
Protein GI194366274 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00454127 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA TGACCCACGG CCGCGTCCCG CGCGGCCTCG TTTCCGTGCG CGCCGAAGGC 
GCAAACCAGC CCGACGTGAA GGCGCTGGTG GAGTCGCTGA ACAAGGCATT CGCCGACTTC
AAGGCCGAGC ACACCAAGCA GCTGGAAGAG ATCAAGAAGG GCAGCGCCGA TGCACTGCAG
GCCTTGAAGG TCGACAACAT TAATGCCGAC ATCACCCGCC TGCAGGCCGC GGTCGACCAG
GCCAACACCC AAATGGCCGC GTTCCAGATG GGCGGTGGTA GCGCCGGCAG CGGTGTCGCC
GACGCCGAGT ACACCGAGTC GTTCCGCGCC CACTTCCGAA AGGGTGAAGT GCAGGCGGCC
CTGAACAAGG GTGCTGCCGA TGAAGGTGGC TATCTGGCGC CGATCGAATG GGATCGTTCG
ATCACCGATC GCCTGGTCAT CGTGTCGGAT ATGCGGCAGT TGGCCAACGT GCAGCCCTGC
TCCGGCGCAG GCCTGACCAA GCTCTACAAC ACCGGCGGCA CTTCCTCGGG CTGGGTGGGC
GAAGAGGATC CGCGCCCGGA GACCGCGACT GCGAAGCTGC GCCCGCTCAG CTTCGGCTGG
GGTGAGATCT ACGCCAACCC GGCAGCGACC CAGCAGCTGC TGGACGATGC CGAGATTGAC
CTGGAGGCGT GGCTGGCCGG CGAGGTCGAG CTGGAGTTCG CCAAGCAGGA GGGCGATGCG
TTCTTCTCCG GCAATGGCGT CAACAAGCCG TTCGGCATCC TGACCTACGT GGACGGTGGC
GCCAACGCGG GCAAGCACCC GTTTGGTGCG ATCAAGGTGG TGAACAGCGG GCTGGCGGCC
GGCATCAACG GTGACAGCAT TCTGGACCTG GTCTATGACC TGCCGTCGGC ATTCACCGCG
GGCGCCAAGT TCGCGCTGAA CCGCAAGACC CAGGGTGTGG TGCGCAAGCT GAAGGATGCC
CAGGGCAACT ACCTGTGGCA GCCGTCGCTG GTGGCGGGTC AGCCGTCGAC CCTGGCCGGC
TTTGCGGTGC AGGACGTGGC TGCGATCCCG GACGTGGCAG CAAACGCCAT TGCCGCGCTG
TTCGGCGACT TCAAGCAGAC CTACACCGTG TACGACCGCA AGGGCGTACG CGTGCTGCGC
GACCCGTACA CCAACAAGCC CTACGTGATG TTCTACACCA CCAAGCGCGT GGGTGGCGGT
GTGCACAACC CGGAGCCGAT GCGCGCCCTC AAGATCGCGG CTTCGGCCTG A
 
Protein sequence
MTKMTHGRVP RGLVSVRAEG ANQPDVKALV ESLNKAFADF KAEHTKQLEE IKKGSADALQ 
ALKVDNINAD ITRLQAAVDQ ANTQMAAFQM GGGSAGSGVA DAEYTESFRA HFRKGEVQAA
LNKGAADEGG YLAPIEWDRS ITDRLVIVSD MRQLANVQPC SGAGLTKLYN TGGTSSGWVG
EEDPRPETAT AKLRPLSFGW GEIYANPAAT QQLLDDAEID LEAWLAGEVE LEFAKQEGDA
FFSGNGVNKP FGILTYVDGG ANAGKHPFGA IKVVNSGLAA GINGDSILDL VYDLPSAFTA
GAKFALNRKT QGVVRKLKDA QGNYLWQPSL VAGQPSTLAG FAVQDVAAIP DVAANAIAAL
FGDFKQTYTV YDRKGVRVLR DPYTNKPYVM FYTTKRVGGG VHNPEPMRAL KIAASA