Gene Saro_0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0649 
Symbol 
ID3918074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp688371 
End bp689768 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content62% 
IMG OID640443380 
ProductPhage major capsid protein, HK97 
Protein accessionYP_495930 
Protein GI87198673 
COG category 
COG ID 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.296668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA ACCTGATTGC TGCGGCCCTT GTGGTCGCGG TATCGCTGGT GCTCGCTCCG 
AGCGCCGCCT TTGCTGCCAC TTTCGCGCAG CAGATCCATC CGTCGCTGGA CATCAACGGT
GCTCTTGCCG CGATCGGCAT GCTGGCGGCT GTTGGCAGCA TCAGCGAGTT CGGCCGCAAG
CAGGCAGGCG AAGGTGAGGG TGAGCTCAAG CAGCTCGCGG TCGACCTGAA GTCTGCGACG
GACCAGGTAA AGACCTTCGC CGAGAAGGCG GATGCGGAAA TGAAGCGTCT CGGCACGGTC
ACCGAGGAAA CCAAGAAGAG CGCAGACGAA GCGCTCATCA AGATGAACGA AACGACCGCT
CGAATCGATG CGATCGAGCA GAAGCTCGCT CGCCGCGGCG AAGAGGGCGA GAAGCGCCGC
GCCAAGACGG CCGGCCAGGA AGTGACCGAG AGCGAGGAGT TCAAGGCTTG GCTCGGCGGC
AACCGCAAGA ACACGTTCAG CATGCAGGTG AAGGCGATCA TTTCGTCGCT CACCACCGAC
GCTGATGGCT CGGCGGGCGA CCTGATCGTT CCGCAGCGCC AGCCCGGTAT CATTGGCCTG
CCGCAGCGCC GCATGACGAT CCGCGACCTG CTCACGCCGG GCAACACCGG TTCGAACGCG
ATCCAGTACG TGAAGGAAAC CGGCTTCACC AACAACGCTG CCACCGTGAC TGAAACCGCC
GGCACGGCGA AGCCGCAGTC GGAGATCAAG TTCGACATCG TCACCAGCTC GGTCACGACG
ATCGCTCACT GGGTGCTTGC GACCAAGCAG ATCCTCGACG ACGTGCCGCA GCTGCGCTCA
TACATCGATG GCCGTCTGCG TTATGGTCTG GAGTACGTCG AAGAAGGGCA GCTGCTCAAC
GGTGGCGGCA CTGGCACCGA TCTCAACGGC ATCTACACCC AGGCAACGGC TTTCGCGGCG
CCAATCACCC CCACCGCCGC CGGCATGATG ACGAAGATCG ACATCATTCG TCTCGCCATT
CTTCAGGCAG CTCTCGCGGA ACTGCCGGCC AACGGCATCG TGATGCACCC CAGCGATTGG
GCTGACATCG AGCTGACCAA GACCGATGAT GGCGCTTACC TGTTCGCCAA TCCGCAGGGT
GGCAGCGAGG CCCGACTGTG GCGCCTGCCT GTCGTCGAAA CGCAGGCGAT GACCGTCGAC
AAGTTCCTTA CCGGAGCTTT CCAGATGGGT GCGCAGGTGT TCGATCGCGA AGAAGCCAAC
GTCGAGATCT CGACTGAGGA CAGCGACAAC TTCCGCAAGA ACCTGGTCAC CATTCGCGCC
GAGGAGCGTC TCGCGCTCGC GGTCTATCGG CCGGAAGCCT TCATCAAGGG CGACTTCAGC
GACGCGCTGG CACTCTGA
 
Protein sequence
MKKNLIAAAL VVAVSLVLAP SAAFAATFAQ QIHPSLDING ALAAIGMLAA VGSISEFGRK 
QAGEGEGELK QLAVDLKSAT DQVKTFAEKA DAEMKRLGTV TEETKKSADE ALIKMNETTA
RIDAIEQKLA RRGEEGEKRR AKTAGQEVTE SEEFKAWLGG NRKNTFSMQV KAIISSLTTD
ADGSAGDLIV PQRQPGIIGL PQRRMTIRDL LTPGNTGSNA IQYVKETGFT NNAATVTETA
GTAKPQSEIK FDIVTSSVTT IAHWVLATKQ ILDDVPQLRS YIDGRLRYGL EYVEEGQLLN
GGGTGTDLNG IYTQATAFAA PITPTAAGMM TKIDIIRLAI LQAALAELPA NGIVMHPSDW
ADIELTKTDD GAYLFANPQG GSEARLWRLP VVETQAMTVD KFLTGAFQMG AQVFDREEAN
VEISTEDSDN FRKNLVTIRA EERLALAVYR PEAFIKGDFS DALAL