Gene SAG1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1194 
Symbol 
ID1014001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1199064 
End bp1200254 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content34% 
IMG OID637316379 
Producthypothetical protein 
Protein accessionNP_688203 
Protein GI22537352 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGAC CATCAGAAAA AGAATTTAAA AATAGTTTGT TTTTTAAATG GATCTTAAAT 
AATCAAGCAG TTATTGCTCT CATGATTACC TTTTTGGTAT TTTTAACGAT TTTTATTTTT
ACCAAAATCT CTTTTATGTT TAAACCTGTG TTTGATTTTC TTGCTGTGCT GATATTGCCG
CTTGTAATTT CTGGCTTGCT TTATTACCTA TTAAAACCTA TGGTTACATT TTTAGAGAAG
CGGGGAATTA AGCGTGTAAC AGCGATATTA TCAGTTTTTA CTATTATAAT CCTTCTGTTA
ATTTGGGCAA TGTCTAGTTT TATTCCCATG ATGAGTAATC AATTACGCCA TTTTATGGAA
GATCTCCCTT CATATGTGAA TAAAGTGCAA ATGGAAACAA GTTCGTTTAT AGATCACAAC
CCTTGGTTAA AATCTTATAA AGGGGAAATA TCGAGCATGT TATCTAATAT CAGTAGCCAA
GCGGTCTCTT ATGCTGAAAA ATTTTCAAAG AATATTTTAG ATTGGGCAGG AAATTTAGCT
AGTACAGTTG CACGTGTGAC AGTAGCAACA ATCATGGCTC CCTTTATTTT GTTTTATCTT
TTAAGAGATA GTCGCAACAT GAAGAATGGT TTCTTAATGG TTTTACCAAC CAAACTACGC
CAACCAACTG ATCGTATTTT GCGAGAAATG AATAGTCAAA TGTCAGGGTA TGTGCAAGGA
CAAATCATTG TTGCTATTAC TGTTGGTGTT ATTTTTTCAA TAATGTATAG TATTATAGGC
CTTAGATATG GCGTGACATT AGGGATTATT GCCGGTGTGT TAAATATGGT TCCCTATTTG
GGAAGTTTTG TCGCCCAAAT TCCAGTGTTT ATCTTAGCGC TTGTCGCAGG ACCTGTTATG
GTTGTTAAAG TTGCGATTGT TTTTGTTATT GAGCAAACTC TAGAAGGACG CTTTGTCTCA
CCCTTGGTTT TAGGTAATAA ACTTAGCATT CATCCAATTA CAATTATGTT TATTTTATTA
ACCTCTGGAG CGATGTTTGG TGTTTGGGGA GTATTCCTCA GTATTCCGAT TTATGCATCT
ATCAAAGTTG TTGTTAAAGA ATTGTTTGAT TGGTACAAAG CTGTCAGTGG GCTATATACA
GTAGATGTTG TTACTGAAGA AAGAAGTGAA GAAGTTAAAA ATGTTGAATA G
 
Protein sequence
MNRPSEKEFK NSLFFKWILN NQAVIALMIT FLVFLTIFIF TKISFMFKPV FDFLAVLILP 
LVISGLLYYL LKPMVTFLEK RGIKRVTAIL SVFTIIILLL IWAMSSFIPM MSNQLRHFME
DLPSYVNKVQ METSSFIDHN PWLKSYKGEI SSMLSNISSQ AVSYAEKFSK NILDWAGNLA
STVARVTVAT IMAPFILFYL LRDSRNMKNG FLMVLPTKLR QPTDRILREM NSQMSGYVQG
QIIVAITVGV IFSIMYSIIG LRYGVTLGII AGVLNMVPYL GSFVAQIPVF ILALVAGPVM
VVKVAIVFVI EQTLEGRFVS PLVLGNKLSI HPITIMFILL TSGAMFGVWG VFLSIPIYAS
IKVVVKELFD WYKAVSGLYT VDVVTEERSE EVKNVE