Gene SAG1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1943 
Symbol 
ID1014753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1931943 
End bp1932980 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content39% 
IMG OID637317110 
Producthypothetical protein 
Protein accessionNP_688931 
Protein GI22538080 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.643032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAA CATTAAACTA CATTAAAACC CTGACATCAA TCCCTTCACC AACAGGGTTT 
ACCCAAACAA TCATGACCTA TATCATCAAA GAATTGGAAG CGTTTGGCTA CTCACCAATT
CGCACAAACA AGGGAGGCGT CATGGTTTCT CTAAAAGGAA AAAATGATAC TAAACATCGC
ATGATAACTG CTCACCTTGA TACACTTGGT GCTATGGTTA GGGCCATCAA ACCAGATGGT
CGGTTAAAAA TTGACCTTAT TGGTGGATAT ACATACAATG CCATTGAAGG AGAAAACTGT
ACTATACACC TCTCAAAAAA TGGTAAAGAA ATTTCTGGAA CTGCTCTTAT TCATCAAACT
AGTGTCCATG TTTACAAAGA CGCTGGAACT GCTGAACGTA ATCAAACAAA TATGGAAATT
CGTTTAGATG AGAAAGTAAC AACTGCTGAC GAAACACGTG CTTTAGGCAT CCAGGTCGGT
GATTTCATTT CATTTGATCC GCGTACAATC ATAACAGACA GCGGCTTTAT TAAATCACGT
TACCTAGATG ACAAGGTATC CGCTGGTATC CTAATGGAAC TTCTTTCTGT TTACAAGAAA
GAAGACATTC AACTTCCTTA TACTACTCAT TTCTACTTTA GTGCCTTTGA AGAGCTAGGA
CATGGAGCAA ATTCAAGCAT CCCAAATGAA ACTGTAGAAT ATCTAGCAGT TGATATGGGA
GCTATGGGAG ACGATCAAGA AACTGACGAA TATACTGTCT CTATCTGTGT TAAAGATGCT
TCTGGTCCTT ATCATTATGA ATTACGTCAA CATCTTGTTT CTCTAGCTGA AAACAATAAT
ATTCCTTATA AACTTGATAT TTATCCTTAT TATGGTAGTG ACGCCTCCGC TGCCATGCGT
GCTGGTGCGG AAGTTAAACA CGCGCTACTT GGTGCAGGTA TTGAATCTAG TCATTCTTAT
GAACGTACCC ATATCGATTC TATTCAAGCA ACTGAACTCT TAGTGGATGC CTATCTCAAA
AGCAATATGG TGGACTAA
 
Protein sequence
METTLNYIKT LTSIPSPTGF TQTIMTYIIK ELEAFGYSPI RTNKGGVMVS LKGKNDTKHR 
MITAHLDTLG AMVRAIKPDG RLKIDLIGGY TYNAIEGENC TIHLSKNGKE ISGTALIHQT
SVHVYKDAGT AERNQTNMEI RLDEKVTTAD ETRALGIQVG DFISFDPRTI ITDSGFIKSR
YLDDKVSAGI LMELLSVYKK EDIQLPYTTH FYFSAFEELG HGANSSIPNE TVEYLAVDMG
AMGDDQETDE YTVSICVKDA SGPYHYELRQ HLVSLAENNN IPYKLDIYPY YGSDASAAMR
AGAEVKHALL GAGIESSHSY ERTHIDSIQA TELLVDAYLK SNMVD