Gene Saro_0647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0647 
Symbol 
ID3918072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp686369 
End bp687673 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content62% 
IMG OID640443378 
ProductPhage portal protein, HK97 
Protein accessionYP_495928 
Protein GI87198671 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.239357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCAGTC TGGCCCCGCG AACGTCCTGG CTCGCGAAGG CGACGACGGC GCTGCGCGAC 
TGGCTGGTGC TCGGACCGGA TCCGAAGATC CGCGACCCAC AGAATTCGTC GCGTGGGGGG
AACGGGGCGG GCGTAACCGT CAATGACCAG GCTGCGATGC GGCTGAGTGC ATTCTGGGGC
TGCGTCCGCC TCATCTCGTC TACGATCGGC TCGCTACCGG TCCCGGTCTA TACCGTCGAC
CAGCGCGGAG TTCGCTCGGT CGCTCGGGAG AGTGCACTGT ATCGTGTGTT GCACGATAGC
CCGAACGCTG ACCAGACGCC GGTCGATTAC ATGGAATGCG CCGTCATTTC GCTACTTTTG
CGTGGCAACC ACTACGCCCG CAAGCTGATG GAAGGTGGCC GACTGGTTGG CCTCGAGCCG
ATCAATCCCG CGATCGTCAG CGTCCGCCGG CGTTCCGATG GCAGGATTGG GTATCGTTGG
ACCGAAGGCG GTGAAAACTT CGACCTTACC GAGGATGAAG TTTTTCACGT TCGTGGATTC
GGCGGCGGGC CGCTGGGCGG GCTTTCGACT GTCGAGTTTG CACGTGAATC GCTGGGCGTA
GCGATCGCTG CGGACCGCGC CGCGAGCGCG ATTTTCGCCA ACGGGGTGAA CCCGACAGGA
ATCATGTCGA CTGATATGCC GCTGACGGCT GCGCAGCAGG CAGAGGCAGA GGAGTTGATC
GTCAAGAAGT ACCAGGGAGC GCACCGCATG GGTGTCCCGA TGGTGCTTGG CCACGGGTTG
AAGTGGAATT CAATCACGAT GAAGGCCGAC GACGCCCAGC TGCTGCAAAG CCGGGGTTGG
AGCGTAGAGG AGATTTGCAG GTGGTTCGGC GTTCCGCCGT TCATGATCGG TCACAACGAG
AAGACCACGA GTTGGGGTAC CGGCATCGAG CAGATGCTGC TGGGCTTCCA GAAATTTACT
CTCAATCCCT ACCTGCGACG CATTGAGCAG GCTGTGCGCA AGCAGCTGAT CACTCCGATC
GAGCGTGCCC GTGGTCTGAC CGCCGAATTC AATCTTGAAG GCCTCCTGCG GGCCGACAGC
GCGGGTCGCG CATCGTTCTA CGACAAGGCG CTCAAGTCGA AGTGGATGGT CATCAACGAA
GTCCGGGCAA AGGAGAACCT TGCGCCGGTG CCGTGGGGCG ATGAGCCGAT CGTGCAGCAG
CAGGACGTGC CGCTGTCCGA TCAGCTCGAT GCCCTCCGGG AAGCAATCAA GAACGCCCAG
GACGTGGCCG GGCTGTTCCA GAAGGGAAAC GCCAATGCAG CGTAA
 
Protein sequence
MSSLAPRTSW LAKATTALRD WLVLGPDPKI RDPQNSSRGG NGAGVTVNDQ AAMRLSAFWG 
CVRLISSTIG SLPVPVYTVD QRGVRSVARE SALYRVLHDS PNADQTPVDY MECAVISLLL
RGNHYARKLM EGGRLVGLEP INPAIVSVRR RSDGRIGYRW TEGGENFDLT EDEVFHVRGF
GGGPLGGLST VEFARESLGV AIAADRAASA IFANGVNPTG IMSTDMPLTA AQQAEAEELI
VKKYQGAHRM GVPMVLGHGL KWNSITMKAD DAQLLQSRGW SVEEICRWFG VPPFMIGHNE
KTTSWGTGIE QMLLGFQKFT LNPYLRRIEQ AVRKQLITPI ERARGLTAEF NLEGLLRADS
AGRASFYDKA LKSKWMVINE VRAKENLAPV PWGDEPIVQQ QDVPLSDQLD ALREAIKNAQ
DVAGLFQKGN ANAA