Gene SAG1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1804 
Symbol 
ID1014613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1800413 
End bp1801402 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content37% 
IMG OID637316972 
Producthypothetical protein 
Protein accessionNP_688794 
Protein GI22537943 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTGATGGACA TGCTCATATT GTTGATACAA TTGCAGGATT TAATGGTAAA 
GGACGTCTTA ATGCCCTTGG AAACGGTTAT GCTATTTGGG ATGATGGTGA TTTAATTAAG
CTCATCCCAG ACGAATACGG TGACAAAGCA TTTACAGCTG AAGCTTTTTT GAAGTATATG
GATGCTAATG GTGTAGATAA AGCAGTCATT TTGCAAGGTC ATCTTAATGG TTATCAAAAT
TATTATACGC ATTTAGCTAT TAAACGTTAT CCAGAACGTT TTACAGGCGC ATTCTCTGTT
GATCCATTTG CAGATAACGC AATGCAAATT GTAAAACGTC ATGTTGAAGT CCTTGGTTTT
CGAGCTATTA AATTTGAAAT TAGTCAAGGA GGGGGTATCC ATGGATACCG TGGACAAAAA
ACGCCTTTCC GTCTTGATAC AGACCCACAT GTGAGTCGTA TTTTGACTTA TCTGTTAGAC
TATCCAGGAT TTGTAGTTAC AGTTGATTAT GGCAATTGGG ATCAAATTAG TCACCAACCT
GATGCTATTG CTAATTTAGC TCGCCTTTAT ACGAGTCTCG ATTTTGTTGT TTGTCACCTC
TCTTTTCCAC ATGTTGAGCA TTCAAATCGA TTGCGTGCTG AACTTAATAT GTGGAAAAAC
TTCAACAATA TCTATACTGA TATTTCAGCT ATTCAAGATA TTGATGCCCC AGATAGTTTT
CCTTTTCCAA AATCAGAACA AAATGTACGT ATTGCTAAAG AAGTTCTTGG GGCTAAGCGT
ATCATTTGGG GAACTGATTC TCCATGGTCA GCGACCTTCA ATACTTACGA AGAATTAGCT
ACTTGGCTAG AGAATGTTGA TATTTTCAGT CAAGAGGAAC TAGAAGATGT TATGTACAAT
AATGCTGAGC GTGTTTACTT TAAAGAAGAA CATGTAAAGG CTAACAGAGC AGGTATAGAT
GATACAACCA AAGAGCTCAA TCTCTATTAG
 
Protein sequence
MKKIDGHAHI VDTIAGFNGK GRLNALGNGY AIWDDGDLIK LIPDEYGDKA FTAEAFLKYM 
DANGVDKAVI LQGHLNGYQN YYTHLAIKRY PERFTGAFSV DPFADNAMQI VKRHVEVLGF
RAIKFEISQG GGIHGYRGQK TPFRLDTDPH VSRILTYLLD YPGFVVTVDY GNWDQISHQP
DAIANLARLY TSLDFVVCHL SFPHVEHSNR LRAELNMWKN FNNIYTDISA IQDIDAPDSF
PFPKSEQNVR IAKEVLGAKR IIWGTDSPWS ATFNTYEELA TWLENVDIFS QEELEDVMYN
NAERVYFKEE HVKANRAGID DTTKELNLY