Gene SAG1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1000 
Symbol 
ID1013804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1007535 
End bp1008806 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content33% 
IMG OID637316184 
Producthypothetical protein 
Protein accessionNP_688011 
Protein GI22537160 
COG category[S] Function unknown 
COG ID[COG4487] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0413832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAA TCAAATGCCC TCATTGTGGA ACAGCTTTTG CCATCAACGA GTCTGAATAC 
CATCAATTAC TAGAACAAAT TCGCGGAGAT GCTTTTGACA AAGAAGTAAG TGAACGGTTG
GAAAAAGAAC GTCTAATATT AGGGGAGCAA GCAAAAAATC AATTACAGGA AGTTGTTGTA
GAAAAAGACA AGGAGATAGC TAAACTTCAG TACAAAGTCA AACAATTTCT TATAGAAAAA
GACAATCTTC TCAAAGACAA TGAGTACCAA CTCGCTGAGC AATTAAATCA AAAAGACATG
ATGCTTCGCG ACCTTGAAAA CCAAATCGAT AGACTACGTT TAGAGCATGA AAATAGCTTG
CAAGAGGCGC TAACAAAAGT CGAACGAGAA AGAGATGCAA TACAAAATCA GTTGCACATT
CAAGAAAAAG AAAAAGATTT AGCTTTAGCT TCAGTAAAAA GTGATTATGA AGTACAACTA
AAGGCAGCCA ATGAACAAGT AGAATTCTAT AAAAACTTCA AAGCTCAACA GTCTACTAAA
GCAGTAGGAG AAAGTTTAGA ACATTATGCT GAAACAGAAT TTAATAAAGT GCGACATTTG
GCCTTTCCTA ATGCTTATTT TGAGAAGGAC AATACATTAT CAAGTCGTGG CTCAAAGGGG
GACTTTATCT ATCGAGAAAA GGATGAAAAT GACCTTGAGT TTTTAAGTAT CATGTTTGAA
ATGAAAAATG AGTCTGATGA TACTATCAAG AAGCATAAAA ATGAAGATTT TTTCAAAGAA
TTAGATAAAG ATCGTCGTGA AAAATCTTGC GAATACGCAG TTTTAGTAAC TATGCTTGAA
GCAGACAATG ACTATTATAA TACTGGAATT GTTGATGTTA GTCACAAATA CCCTAAAATG
TACGTTATAC GTCCACAATT TTTTATCCAA TTAATTGGTA TTCTAAGAAA TGCAGCACTC
AATACCTTAA AATATAAACA AGAGCTTGCT TTGATGAAAG AACAAAATAT TGACATCACA
CATTTTGAAG AAGATTTAGA TATTTTCAAA AATGCATTTG CTAAAAATTA TAATTCTGCA
AGCAAAAATT TCCAGAAAGC AATCGATGAA ATAGATAAAT CTATTAAACG TATGGAAGCT
GTTAAGGCTG CTTTAACAAC GTCTGAAAAT CAACTACGTC TTGCAAATAA TAAATTAGAC
GATGTTTCTG TCAAGAAATT AACAAGAAAA AATCCAACAA TGAAAGCAAA ATTCGATGCT
CTAAAAGACT AA
 
Protein sequence
MNEIKCPHCG TAFAINESEY HQLLEQIRGD AFDKEVSERL EKERLILGEQ AKNQLQEVVV 
EKDKEIAKLQ YKVKQFLIEK DNLLKDNEYQ LAEQLNQKDM MLRDLENQID RLRLEHENSL
QEALTKVERE RDAIQNQLHI QEKEKDLALA SVKSDYEVQL KAANEQVEFY KNFKAQQSTK
AVGESLEHYA ETEFNKVRHL AFPNAYFEKD NTLSSRGSKG DFIYREKDEN DLEFLSIMFE
MKNESDDTIK KHKNEDFFKE LDKDRREKSC EYAVLVTMLE ADNDYYNTGI VDVSHKYPKM
YVIRPQFFIQ LIGILRNAAL NTLKYKQELA LMKEQNIDIT HFEEDLDIFK NAFAKNYNSA
SKNFQKAIDE IDKSIKRMEA VKAALTTSEN QLRLANNKLD DVSVKKLTRK NPTMKAKFDA
LKD