Gene Saro_0646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0646 
Symbol 
ID3918071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp684636 
End bp686372 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content62% 
IMG OID640443377 
Productphage terminase 
Protein accessionYP_495927 
Protein GI87198670 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0135346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGGC CAGTGTGGTC GACTGCCTGC CAGGATTGGC GGGAGCGCAT CGTGGCGCGC 
GAGAGCATCG CGCCATGCGG GCCGCTGTTC CCGGCGAAAG CCGCGGATGC GTTGGGGGTG
TTCAAGTCAC TGCAGGTCAC CGACCTGCCT AAAAGGGAGA ATGGGACCCA CCCGACGTTG
GGTGAAGTCT GCGACCAGTT CGTTTTTGAC CTGGTTGCGG CCATCTTCGG CGCCGAGGAT
CCGGAAACTG GCGAGCGGTT GATCAAGGAG TTCATGCTCC TGATCAGCAA GAAGAATGGC
AAGTCGATGA TCGCAGCGGG CATCATGGTG ACCGCGTTGA TCCTCAACTG GCGTCCGAAC
GCGATCCTGC AGATCCTTGC GCCCACGATC GAGGTTGCGA ACAACAGCTT TGAGCCTGCC
ATGGGGATGG TAAGAGCGGA CGCCGAGCTG GCGATTGTCC TGAAGGTTGT CGAGCATCAG
CGACAGATCA AGCACCTGAC GACGGGTGCG GTATTGCGGG TCATTGCCGC CGACAGCGAT
ACCGTAGCTG GCGGCAAGGC GGCGATGACG TTGATCGAAG AGCTCTGGCT CTTCGGGAAG
AAGGCCAAGG CAGCCGCGAT GTTGCGAGAG GCGCTGGGTG GCGGATCGGC AAGGCCGGAA
GGCTTTACGC TCTACATCAC GACCCATTCG GACGAACCAC CGGCGGGCGT GTTCAAGACC
AAGCTGTCGT ACTTCCGCGA TGTGAGGGAT GGTGAGATCG AGGATCCTGC TACCTTCCCG
ATGCTGTACG AGTGGCCGGA AGACCTGTTG GAGTGCGAGT CCTACCTCGA TCCAGAGTTC
TTCTACGTCA CGAATCCACA CGTTGGCCGA TCGGTCTCGA TTGAATGGCT CAAGTCGGAA
CTGCAGAAGG AACAGATCGG CGAAGGTGAA GGCCTTCAGA TCTTCCTGGC GAAGCATCTG
AACGTCGAGA TTGGTTTGCG GTTGCGGCGC GATCGTTGGG GCGGTGCTGA GCTTTGGCTC
GATGCAGCGA ATGATGACCT TGATCTCGAC CAGCTGCTTG AGCGCTGCGA AGTGGCGATC
GTTGGGCTCG ACATGGGCGG CCGGGACGAC TTGGCCGGTG CCGGCGTGGT CGGTCGCGAA
AAGGGAACGG GTATATGGCT GGGCTGGGCG CATGCCTGGG CGCAGCGGGT TGCGCTGGAG
CGGCGCAAGC AGGTGGCGCC GACGCTGCAA GGCTTTGCGG CTGAAGGCGA CCTGACCTTC
ACCGATTCCG GTGAGGAAAT CGTGAGCGCC ATGGCGCGCC TTGCAATTCG GGTCCGCGAC
AGCGGCAAGA TGCCTGCGGA TGGCGGGGTT GCGGTCGATG CCTGGGGCAT CGGTCCACTC
GTCGATGCGC TGGTGCAGGC CGGGTTCGAT CCTGGCGACG AGGCAATGAA GCGCGCGGGG
CATATCGCCT CGATCAGGCA GGGTGTTGGC CTGTCGAGCG CGATCTACAC GCTGGAATTC
AAGCTCGGCG ACGGGATGTT CCGTCACGAC GGTTCGAACA TGATGGCCTG GTGCGTGAGC
AACGCGCTGG TCAAGCTCAG GGGCAGTGCC TTGTACGTCG ACAAAGAGAC ATCAGGCGCG
GGCAAGATCG ACCCGTTCGT GGCGCTGCTC AATGCAGTGA AGCGTATGGA AGAGGGCCCG
GTGGCCGTGG CTGGCGGCGT CGATAGCTGG CTCGCCAGCT TGCGTGGTGC GGCGTGA
 
Protein sequence
MARPVWSTAC QDWRERIVAR ESIAPCGPLF PAKAADALGV FKSLQVTDLP KRENGTHPTL 
GEVCDQFVFD LVAAIFGAED PETGERLIKE FMLLISKKNG KSMIAAGIMV TALILNWRPN
AILQILAPTI EVANNSFEPA MGMVRADAEL AIVLKVVEHQ RQIKHLTTGA VLRVIAADSD
TVAGGKAAMT LIEELWLFGK KAKAAAMLRE ALGGGSARPE GFTLYITTHS DEPPAGVFKT
KLSYFRDVRD GEIEDPATFP MLYEWPEDLL ECESYLDPEF FYVTNPHVGR SVSIEWLKSE
LQKEQIGEGE GLQIFLAKHL NVEIGLRLRR DRWGGAELWL DAANDDLDLD QLLERCEVAI
VGLDMGGRDD LAGAGVVGRE KGTGIWLGWA HAWAQRVALE RRKQVAPTLQ GFAAEGDLTF
TDSGEEIVSA MARLAIRVRD SGKMPADGGV AVDAWGIGPL VDALVQAGFD PGDEAMKRAG
HIASIRQGVG LSSAIYTLEF KLGDGMFRHD GSNMMAWCVS NALVKLRGSA LYVDKETSGA
GKIDPFVALL NAVKRMEEGP VAVAGGVDSW LASLRGAA