Gene SAG1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1035 
Symbol 
ID1013839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1042635 
End bp1043909 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content33% 
IMG OID637316218 
Producthypothetical protein 
Protein accessionNP_688045 
Protein GI22537194 
COG category[S] Function unknown 
COG ID[COG4499] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAG ATTCGTTTGT ATTTGAAGAA CAAGACTTTC AATTTGAAAA AGGACCAGAT 
TTTTGGCAGT TATCTCTTAA GCGTTCAAAA GTAGGAACTC AAAATCTGCA ACAACTGCAA
TTGTTAGAAA TTCACCATTC TTTGTTGATG CCTATGACAA CTGCAATTGA TGCAGATACC
ATTCAATTTC AATTCCAGAC AGAAGCGCAC GCTCTATTTT TTGAAAGCTT TAAAAAAGAA
ACTTTATCTG AAAAATTACG ATTAGCCTTG AATGTCTTGG ATTTAGATAA GGCTTTGTCT
TTGTCAGTAA ATTTCATTTT GCATCCCTCT AATCTATTTT TAACAAAAAA TGCAACTGCT
AAAATAGCTT ATCGTAGCCT TCCTGGGATT ATGAGACCTG AGAAATTTGG TCCAGAAGAG
TTTTTATATC AATTCAAATG TTTTGTCTTT GCATTATTGA CGCAACATGA CTATATAGAG
TTGTATAATG GTGCTATTTC TGTTATTGAA GTATCAGATT TCCTAAAAAG CATTTATCAT
GCAGAAACTA TTCAGGCTGT TAGAGACATC ATTACTATTG ATTATGAGCA GCAAGTAGAA
GTAGAAACTC ATACGTTAGC AAAAGTATCA AGGGCAAAAT ATAAGCTTTA TAAATACATA
AGTGTTTGGC TAGGAGCTTT ATCTACAATA CTCTTGATTC CGCTAGTATA TCTGGTTTTT
ATTCACAATC CATTCAAAGA AAAAATGTTG GCAGCGGATA CTTCATTTAT TAAAGTTGAT
TATAATCAAG TGATTAATCG ACTAGAACAT GTAAAAGTAA GTAAGTTACC TTATACACAG
AAGTACGAGT TGGCCTATTC CTATATTAAT GGAATGTCAT TTTCTGAAGA ACAGCGTGAA
GTTATTTTAA ACAATGTTAC GCTCAAGACT GATGAGCTCT ATCTTGATTA TTGGATTAAT
ATTGGCCGTG GTTTAGATGA TGATGCTATT GATGCTGCCA AACGTTTAGA CGATTCTGAC
CTTGTTATTT ATGCTATTGT TCAGAAAATG GATCAGGTTA GAAAAGACAA TAGTTTATCT
GGTAAAGATC GTGAACAAAA ACTTTCTGAG CTACAAACAG ACTATGATAA ATATTGGAAA
GACCGTAAGA CAGCCTTAAC AGATGAGGAA TCCAAATCGA AAAATAGCAA TAATCATTCG
ACAAATTCCA ACAAAGAGTC ATCTGAATCA TCAAGTACTA CAGCAAGTAC ATCTTCTAAA
ACTAAAAGTA GGTAG
 
Protein sequence
METDSFVFEE QDFQFEKGPD FWQLSLKRSK VGTQNLQQLQ LLEIHHSLLM PMTTAIDADT 
IQFQFQTEAH ALFFESFKKE TLSEKLRLAL NVLDLDKALS LSVNFILHPS NLFLTKNATA
KIAYRSLPGI MRPEKFGPEE FLYQFKCFVF ALLTQHDYIE LYNGAISVIE VSDFLKSIYH
AETIQAVRDI ITIDYEQQVE VETHTLAKVS RAKYKLYKYI SVWLGALSTI LLIPLVYLVF
IHNPFKEKML AADTSFIKVD YNQVINRLEH VKVSKLPYTQ KYELAYSYIN GMSFSEEQRE
VILNNVTLKT DELYLDYWIN IGRGLDDDAI DAAKRLDDSD LVIYAIVQKM DQVRKDNSLS
GKDREQKLSE LQTDYDKYWK DRKTALTDEE SKSKNSNNHS TNSNKESSES SSTTASTSSK
TKSR